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1 
Introduction 


1.1 The physics of the LHC era 
1.1.1 Particle physics in the LHC era 


The turn-on of the LHC in 2008 culminated an almost 20-year design and construction 
effort, resulting in the largest particle accelerator (actually the largest machine) ever 
built. At its inception a competition still existed with the TEVATRON which, although 
operating at a much lower energy, had a data sample with a large integrated lumi- 
nosity and well-understood detectors and physics-analysis software. The TEVATRON 
had discovered the top quark and was continuing its search for the Higgs boson. As is 
well known, the LHC suffered considerable damage from a cryogenic quench soon after 
turn-on that resulted in a shut-down for about 1.5 years. Its (re)turn-on in 2010 was 
at a much lower energy (7 TeV rather than 14 TeV) and at much lower intensities. The 
small data sample at the lower energy can be considered in retrospect as a blessing in 
disguise. There was not enough data to even consider a search for the Higgs boson (or 
even for much in the way of new physics), but there was enough data to produce W 
and Z bosons, top quarks, photons, leptons and jets — in other words, all of the parti- 
cles of the Standard Model except for the Higgs boson. The result was the re-discovery 
of the Standard Model (a coinage for which one of the authors takes credit) and the 
development of the analysis tools and the detailed understanding of the detectors that 
allowed for the discovery of the Higgs boson on July 4, 2012, with data from 7 TeV 
in 2011 and 8 TeV in 2012. The LHC turned off again in early 2013 for repairs and 
upgrades (to avoid the type of catastrophic quench that occurred in 2008). The LHC 
detectors also used this two-year period for repairs and upgrades. The LHC ran again 
in 2015, at an energy much closer to design (13 TeV). The increased energy allowed for 
more detailed studies of the Higgs boson, but more importantly offered a much greater 
reach for the discovery of possible new physics. At the time of completion of this book, 
a great deal of physics has been measured at the operating energy of 13 TeV. Given 
the new results continually pouring out at this new energy, the decision was made to 
concentrate in this book on results from 7 and 8 TeV running. This is sufficient for 
the data comparisons needed to illustrate the theoretical machinery developed here. 
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1.1.2 The quest for the Higgs boson — and beyond 
1.1.2.1 Finding the Higgs boson 


The LHC was designed as a discovery machine, with a design centre-of-mass energy a 
factor of seven larger than that of the TEVATRON. This higher collision energy opened 
up a wide phase space for searches for new physics, but there was one discovery that 
the LHC was guaranteed to make; that of the Higgs boson, or an equivalent mechanism 
for preventing WW scattering from violating unitarity at high masses. 

The Higgs boson couples directly to quarks, leptons and to W and Z bosons, and 
indirectly (through loops) to photons and gluons. Thus the Higgs boson final states 
are just the building blocks of the SM with which we have much experience, both at 
the TEVATRON and the LHC. The ATLAS and CMS detectors were designed to find the 
Higgs boson and to measure its properties in detail. 

The cross-section for production of a Higgs boson is not small. However, the final 
states for which the Higgs boson branching ratio is large (such as bb) have backgrounds 
which are much larger from other more common processes. The final states with low 
backgrounds (such as 7Z* — €té~¢+£—) suffer from poor statistics, primarily due 
to the Z branching ratio to leptons. The Higgs— yy final state suffers from a small 
branching ratio and a large SM background. Thus one might not expect this final state 
to be promising for a Higgs boson search. However, due to the intrinsic narrow width 
of the Higgs boson, a diphoton signal can be observable if the experimental resolution 
of the detector is good enough that the signal stands out over the background. 

The measurable final states of the Higgs boson decays were further subdivided into 
different topologies so that optimized cuts could be used to improve on the signal- 
to-background ratio for each topology (for example, in ATLAS the diphoton channel 
was divided into 12 topologies). The extracted signal was further weighted by the 
expectations of the SM Higgs boson in those topologies. In this sense, the Higgs boson 
that was discovered in 2012 was indeed the Standard Model Higgs boson. However, as 
will be discussed in Chapter 9, detailed studies have determined the properties of the 
new particle to be consistent with this assumption. 


1.1.2.2 The triumph of the Gauge Principle 


The discovery of the Higgs boson by the ATLAS and CMS collaboration, reported in 
July 2012 and published in [15, 368], is undoubtedly the crowning achievement of the 
LHC endeavour so far. It is hard to overestimate the importance of this discovery for 
the field of particle physics and beyond. 

The Higgs boson is the only fundamental scalar particle ever found, which in itself 
makes it unique; all other scalars up to now were bound states, and the fundamental 
particles found so far have been all either spin-1/2 fermions or spin-1 vector bosons. 
This discovery is even more significant as it marks a triumph of the human mind: the 
Higgs boson is the predicted visible manifestation of the Brout—Englert-Higgs (BEH) 
mechanism [516, 601, 619-621, 675], which allows the generation of particle masses in 
a gauge-invariant way [580, 835, 888]. Ultimately, this discovery proves the paradigm 
of gauge invariance as the governing principle of the sub-nuclear world at the smallest 
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distances and largest energies tested in a laboratory so far. With this discovery a 
50-year-old prediction concerning the character of nature has been proven 

The question now is not whether the Higgs boson exists but instead what are 
its properties? Is the Higgs boson perhaps a portal to some new phenomena, new 
particles, or even new dynamics? There are some hints from theory and cosmology 
that the discovery of the Higgs boson is not the final leg of the journey. 


1.1.2.3 Beyond the Standard Model 


By finding the last missing particle and thereby completing the most accurate and 
precise theory of nature at the sub-nuclear ever constructed, the paradigms by which 
it has been constructed have proved overwhelmingly successful. Despite this there are 
still fundamental questions left unanswered. These questions go beyond the realm of 
the SM, but they remain of utmost importance for an even deeper understanding of 
the world around us. 

Observations of matter — Earth, other planets in the Solar System or beyond, 
other stars, or galaxies — suggest that the symmetry between matter and anti-matter 
is broken. This is a universe filled by matter and practically devoid of anti-matter. 
While naively there is no obvious reason why one should be preferred over the other, 
at some point in the history of the Universe — and presumably very early — this 
asymmetry had to emerge from what is believed to have been a symmetric initial state. 
In order for this to happen, a set of conditions, the famous Sakharov conditions [710, 
834] had to be met. One of these intricate conditions is the violation of CP, which 
demands that the symmetry under the combined parity and charge-conjugation (CP) 
transformation must be broken. Experimentally, the existence of CP violation has 
been confirmed and is tightly related to the existence of at least three generations 
of matter fields in the SM. Due to the BEH mechanism, particles acquire masses, 
and their mass and electroweak interaction eigenstates are no longer aligned after 
EWSB. The existence of a complex phase in the CKM matrix, which parametrizes the 
interrelation between these two set of eigenstates, ultimately triggers CP violation in 
the quark sector. However, the amount of CP violation established is substantially 
smaller than necessary to explain how the universe evolved from an initial symmetric 
configuration to the matter-dominated configuration seen today [358]. 

Likewise, the existence of dark matter (DM) is now well established, first evidenced 
by the rotational curves of galaxies [831]. DM denotes matter which interacts only very 
weakly with normal matter (described by the SM) and therefore certainly does not 
interact through electromagnetism or the strong nuclear force. Despite numerous at- 
tempts it has not been directly detected. DM interacts through gravity and thereby 
has influenced the formation of large-scale structures in the Universe. Cosmological 
precision measurements by the WMAP and PLANCK collaborations [125, 623, 862] con- 
clude that dark matter provides about 80% of the total matter content of the Universe. 
This in turn contributes about 25% of the overall energy balance, with the rest of the 
energy content of the Universe provided by what is known as dark energy (DE), which 
is even more mysterious than DM. The only thing known is that the interplay of DM 
and DE has been crucial in shaping the Universe as observed today and will continue 
to determine its future. One possible avenue in searches for DM particles at collider ex- 


4 Introduction 


periments is that they have no coupling to ordinary matter through gauge interactions 
but instead couple through the Higgs boson. 

These examples indicate that the SM, as beautiful as it is, will definitely not provide 
the ultimate answer to the questions concerning the fundamental building blocks of the 
world around us and how they interact at the shortest distances. The SM will have to be 
extended by a theory encompassing at least enhanced CP violation, dark matter, and 
dark energy. Any such extension is already severely constrained by the overwhelming 
success of the gauge principle: the gauge sector of the SM has been scrutinized to 
incredibly high precision, passing every test up to now with flying colours. See for 
example [179] for a recent review, combining data from e~et and hadron collider 
experiments. The Higgs boson has been found only recently, and it is evident that this 
discovery and its implications will continue to shape our understanding of the micro- 
world around us. The discovery itself, and even more so the mass of the new particle 
and our first, imprecise measurements of its properties, already rule out or place severe 
constraints on many new physics models going beyond the well-established SM [515]. 

Right now, we are merely at the beginning of an extensive programme of precision 
tests in the Higgs sector of the SM or the theory that may reveal itself beyond it. It 
can be anticipated that at the end of the LHC era, either the SM will have prevailed 
completely, with new physics effects and their manifestation as new particles possibly 
beyond direct human reach, or alternatively, we will have forged a new, even more 
beautiful model of particle physics. 


1.1.3 LHC: Accelerator and detectors 
1.1.3.1 LHC, the machine 


The LHC not only is the world’s largest particle accelerator but it is also the world’s 
largest machine, at 27 km in circumference. The LHC is a proton-proton collider (al- 
though it also operates with collisions of protons on nuclei, and nuclei on nuclei), 
located approximately 100 m underground and straddling the border between France 
and Switzerland. The LHC occupies the tunnel formerly used for the LEP accelerator 
in which electrons and positrons collided at centre-of-mass energies up to 209 GeV. 
The LHC contains 9593 magnets, including 1232 superconducting dipole magnets, ca- 
pable of producing magnetic fields of the order of 8.3 T, and a maximum proton beam 
energy of 7 TeV (trillion electron-volts), leading to a maximum collision energy of 14 
TeV. Thus far, the LHC has run at collision energies of 7 TeV (2010, 2011), 8 TeV 
(2012) and 13 TeV (2015,2016), greatly exceeding the previous record of the Fermi- 
lab TEVATRON of 1.96 TeV.! The large radius of the LHC is necessitated because of 
the desire to reach as high a beam energy as possible (7 TeV) using dipoles with the 
largest magnetic fields possible (in an accelerator). Running at full energy, the power 
consumption (including the experiments) is 750 GWh per year. At full power, the LHC 
will collide 2808 proton bunches, each approximately 30 cm long and 16 microns in 
diameter and containing 1.15 x 10" protons, leading to a luminosity of 10°4+cm~?/s 
and a billion proton-proton collisions per second. The spacing between the bunches is 
25 ns leading to collisions occurring every 25 ns; thus, at full luminosity there will 


1 Unlike the LHC, the TEVATRON was a proton-antiproton collider. 
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Overall view of the LHC experiments. 


Fig. 1.1 A 3D layout of the LHC, showing the location of the four major 
experiments. Reprinted with permission from CERN. 


be on average 25 interactions every beam crossing, most of which will be relatively 
uninteresting. The high luminosity for the machine is needed to produce events from 
processes with small cross-sections, for example involving physics at the TeV scale. 

There are seven experiments running at the LHC (ATLAS, CMs, LHCB, ALICE, 
TOTEM, LHCf and MoEDAL), with ATLAS and CMS being the two general-purpose 
detectors. A schematic drawing of the LHC, indicating the position of the four larger 
experiments is shown in Fig. 1.1. 


1.1.3.2 The detectors 


It seems paradoxical that the largest devices are needed to probe the smallest distance 
scales. The ATLAS detector, for example, is 46 m long, 25 m in diameter and weighs 
7000 tonnes. The CMS detector, although smaller than ATLAS at 15 m in diameter 
and 21.5 m in length, is twice as massive, at 14,000 tonnes. This can be compared 
to the CDF detector at the TEVATRON which was only 12mx12mx12m (and 5000 
tonnes). The key to the size and complexity of the LHC detectors is the need to 
measure the four-vectors of the large number of particles present in LHC events, whose 
momenta can extend to the TeV range. The large particle multiplicity requires very 
fine segmentation; the ATLAS detector, for example, has 160 million channels to read 
out, half of which are in the pixel detector. The large energies/momenta require, in 
addition to fine segmentation, large magnetic fields and tracking volumes and thick 
calorimetry. 

Both ATLAS and CMS are what are known as general-purpose 47 detectors, meaning 
that they attempt to cover as much of the solid angle around the collision point as 
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possible, in order to reconstruct as much information about each event as possible.” 
There is a universal cylindrically symmetric configuration for a 4r detector, embodied, 
for example, in the ATLAS detector, as shown in Fig. 1.2. Collisions take place in the 
centre of the detector. Particles produced in each collision first encounter the pixel 
detector (6) and the silicon tracking detector (5). The first layer of the pixel detector 
is actually mounted on the beam-pipe in order to be as close to the interaction point 
as possible. The beam-pipe itself, in the interaction region, is composed of beryllium in 
order to present as little material as possible to the particles produced in the collision. 
The proximity of the pixel and silicon detectors to the collision point and the very fine 
segmentation (50x400 um for the pixel detector and 70 um for the silicon detector) 
allow for the reconstruction of secondary vertices from bottom and charm particles, 
which can travel distances of a few mm from the interaction point before decaying. 
The next tracking device (4), the transition radiation detector, is a straw-tube detector 
that provides information not only on the trajectory of the charged particle but also 
on the likelihood of the particle being an electron. All three tracking devices sit inside 
the central magnetic field of 2T produced by the solenoid (3). 

The energies of the particles produced in the collision (both neutral and charged) 
are measured by the ATLAS calorimeters, the lead-liquid argon electromagnetic calorime- 
ter (7) and the iron-scintillator hadronic calorimeter (Tilecal) (8). Both the ATLAS and 
CMS electromagnetic calorimeter designs emphasized good resolution for the measure- 
ment of the energies of photons and electrons, primarily to be able to distinguish the 
Higgs boson to yy signal from the much larger diphoton background. The width of a 
light Higgs boson is much less than the experimental resolution, so any improvement 
in the resolution will lead to a better discrimination over the background. 

Energetic muons can pass through the calorimetry, while other particles are ab- 
sorbed. The toroidal magnets (2), in both the central and forward regions, produce an 
additional magnetic field (4 T) in which a second measurement of the muon momen- 
tum can be carried out using the muon tracking chambers (1), using several different 
technologies. One of the unique characteristics of the ATLAS detector (and part of its 
acronym) is the presence of the air-core toroidal muon system. The relatively small 
amount of material in the tracking volume leads to less multiple scattering and thus 
a more precise measurement of the muon’s momentum. The muon momentum can be 
measured to a precision of 10% at a transverse momentum value of 1 TeV. 


1.1.3.3 Challenges 


To use a popular analogy, sampling the physics at the LHC is similar to trying to drink 
from a fire hose. Over 1 billion proton-proton collisions occur each second, but the 
limit of practical data storage is on the order of hundreds of events per second only. 
Thus, the experimental triggers have to provide a reduction capability of a factor of the 
order of 107, while still recording bread-and-butter signatures such as W and Z boson 
production. This requires a high level of sophistication for the on-detector hardware 
triggers and access to large computing resources for the higher-level triggering. Timing 


?The main limitation for the solid-angle coverage is in the forward/backward directions, where the 
instrumentation is cut off by the presence of the beam pipe. 
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Fig. 1.2 A layout of the ATLAS detector, showing the major detector 
components, from en.wikipedia. org/wiki/ATLAS_experiment. Original 
image from CERN. Reprinted with permission from CERN. 


is also an important issue. The ATLAS detector is 25m in diameter. With a bunch- 
crossing time of 25ns, this means that as new interactions are occurring in one bunch 
crossing, the particles from the previous bunch crossing are still passing through the 
detector. Each crossing produces 25 interactions. Experimental analyses thus face both 
in-time pileup and out-of-time pileup. The latter can be largely controlled through the 
readout electronics (modulo substantial variations in the population of the individual 
bunches), while the former requires sophisticated treatment in the physics analyses. 

The dynamic ranges at the LHC are larger than at the TEVATRON. Leptons from 
W boson decays on the order of tens of GeV are still important, but so are multi-TeV 
leptons. Precise calibration and the maintenance of linearity are both crucial. To some 
extent, the TEVATRON has served as a boot camp, providing a learning experience for 
physics at the LHC, albeit at lower energies and intensities. Coming later, the LHC has 
benefited from advances in electronics, in computing, and perhaps most importantly, 
in physics analysis tools. The latter comprise both tools for theoretical predictions at 
higher orders in perturbative QCD and tools for the simulation of LHC final states. 

Despite the difficulties, the LHC has had great success during its initial running, 
culminating in the discovery of the Higgs boson, but, alas, not in the discovery of new 
physics. The results obtained so far comprise a small fraction of the total data taking 
planned for the LHC. New physics may be found with this much larger data sample, 
but discovering it may require precise knowledge of SM physics, including QCD. 


1.2 About this book 


The reader is assumed to be already familiar with textbook methods for the calcula- 
tion of simple Feynman diagrams at tree level, the evaluation of cross-sections through 
phase-space integration with analytic terms, and the ideas underlying the regulariza- 
tion and renormalization of ultraviolet divergent theories; however, for a short review, 
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readers are referred to Appendix B.1, and for a more pedagogical introduction to these 
issues to a wealth of outstanding textbooks on various levels, including the books by 
Peskin and Schréder [803], Halzen and Martin [606], Ramond [822], Field [525] and 
others. For a review of QCD at collider experiments, the reader is referred to the 
excellent books by Ellis, Stirling, and Webber [504] and by Dissertori, Knowles, and 
Schmelling [467]. Of course, for a real understanding of various aspects it is hard to 
beat the original literature, and readers are encouraged to use the references in this 
book as a starting point for their journey through particle physics. 

This book aims to provide an intuitive approach as to how to apply the framework 
of perturbative theory in the context of the strong interaction towards predictions at 
the LHC and ultimately towards an understanding of the signals and backgrounds at 
the LHC. Thus, even without the background discussed at the beginning of this section, 
this book should be useful for anyone wishing for a better understanding of QCD at 
the LHC. 

The ideas for this book have been developed over various lecture series given at 
graduate level lectures or at advanced schools on high-energy physics by the authors. 
The authors hope that this book turns out to be useful in supporting the self-study 
of young researchers in particle physics at the beginning of their career as well as 
more advanced researchers as a resource for their actual research and as material for 
a graduate course on high-energy physics. 


1.2.1 Contents 


Chapter 2 provides a first overview of the content of this book and aims at putting 
various techniques and ideas into some coherent perspective. First of all, a physical 
picture underlying hadronic interactions, and especially scattering reactions at hadron 
colliders, is developed. To arrive at this picture, the ideas underlying the all-important 
factorization formalism are introduced which, in the end, allows the use of perturbative 
concepts in the discussion of the strong interaction at high energies and the calculation 
of cross-sections and other related observables. These concepts are then used in a 
specific example, namely the inclusive production of W bosons at hadron colliders. 
There, their production cross-section is calculated at leading and at next-to-leading 
order in the strong coupling constant, thereby reminding the reader of the ingredients 
of such calculations and fixing the notation and conventions used in this book. This 
part also includes a first discussion of observables relevant for the phenomenology of 
strong interactions at hadron colliders. In addition, some generic features and issues 
related to such fixed-order calculations are sketched. In a second part, the perturbative 
concepts already employed in the fixed-order calculations are extended to also include 
dominant terms to all orders through the resummation formalism. Generic features of 
analytical resummation are introduced there and some first practical applications for 
W production at hadron colliders are briefly discussed. As a somewhat alternative use 
of resummation techniques, jet production in electron—positron annihilations and in 
hadronic collisions is also discussed and, especially in the latter, some characteristic 
patterns are developed. 

The next chapter, Chapter 3, is fairly technical, as it comprises a presentation of 
most of the sometimes fairly sophisticated technology that is being used in order to 
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evaluate cross-section at leading and next-to leading order in the perturbative expan- 
sion of QCD. It also includes a brief discussion of emerging techniques for even higher 
order corrections in QCD. In addition, the interplay between QCD and electroweak 
corrections is touched upon in this chapter. Starting with a discussion of generic fea- 
tures, such as a meaningful definition of perturbative orders for various calculations, 
the corresponding technology is introduced, representing the current state of the art. 
As simple illustrative examples for the methods employed in such calculations, again 
inclusive W boson production and its production in association with a jet are em- 
ployed. The calculations are worked out in some detail at both leading and next-to 
leading order in the perturbative expansion in the strong coupling. 

The overall picture and phenomena encountered in hadron—hadron collisions, de- 
veloped in Chapter 2, is discussed in the context of specific processes in Chapter 4. 
The processes discussed here range from the commonplace (e.g. jet production) to 
some of the most rare (e.g. production of Higgs bosons). In each case the underlying 
theoretical description of the process is described, typically at next-to leading order 
precision. Special emphasis is placed on highlighting phenomenologically relevant ob- 
servables and issues that arise in the theoretical calculations. The chapter closes with 
a summary of what is achievable with current technology and an outlook of what may 
become important and relevant in the future lifetime of the LHC experiments. 

Following the logic outlined in Chapter 2, in Chapter 5 the discussion of fixed-order 
technology is extended to the resummation of dominant terms, connected to large log- 
arithms, to all orders. After reviewing in more detail standard analytic resummation 
techniques, and discussing their systematic improvement to greater precision by the 
inclusion of higher-order terms, the connection to other schemes is highlighted. In the 
second part of this chapter, numerical resummation as encoded in parton showers is dis- 
cussed in some detail. The physical picture underlying their construction is introduced, 
some straightforward improvements by introducing some generic high-order terms are 
presented and different implementations are discussed. Since the parton showers are 
at the heart of modern event simulation, bridging the gap between fixed-order per- 
turbation theory at high scales and phenomenological models for hadronization and 
the like at low scales, their improvement has been in the focus of actual research in 
the past decade. Therefore, some space is devoted to the discussion of how the simple 
parton shower picture is systematically augmented with fixed-order precision from the 
corresponding matrix elements in several schemes. 

In Chapter 6, an important ingredient for the success of the factorization formal- 
ism underlying the perturbative results in the previous two chapters is discussed in 
more detail, namely the parton distribution functions. Having briefly introduced them, 
mostly at leading order, in Chapter 2, and presented some simple properties, in this 
chapter the focus shifts on their scaling behaviour at various orders and how this can 
be employed to extract them from experimental data. Various collaborations perform 
such fits with slightly different methodologies and slightly different biases in how data 
are selected and treated, leading to a variety of different resulting parton distribu- 
tions. They are compared for some standard candles in this chapter as well, with a 
special emphasis on how the intrinsic uncertainties in experimental data and the more 
theoretical fitting procedure translates into systematic errors. 
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The tour of ingredients for a complete picture of hadronic interactions terminates in 
Chapter 7, where different non-perturbative aspects are discussed. Most of the ideas to 
address them are fairly qualitative and can be embedded in phenomenological models 
only. Therefore, rather than presenting in detail all developments in this field, the 
book focuses more on generic features and basic strategies underlying their treatment 
in different contexts. Issues discussed there include hadronization, the transition from 
the partons of perturbation theory of the strong interaction, quarks and gluons, to the 
experimentally observable hadrons, and their decays into stable ones, the underlying 
event, which is due to softer further interactions between the hadronic structures of 
the incident particles, and its connection to very inclusive observables such as total 
and elastic cross-sections. 

In Chapters 8 and 9 theoretical results from analytic calculations and simulation 
tools are compared with a host of experimental data. Chapter 8 focuses on data es- 
pecially from the TEVATRON,’ where the foundations of our current understanding of 
the SM and in particular the dynamics of the strong interaction have been shaped. 
In Chapter 9 the most sophisticated calculations and simulations are compared with 
the most recent, most precise and most challenging data so far, taken at the LHC 
during Run I. This comparison ranges from inclusive particle production over event 
shape observables to data testing the dynamics of the SM — and potentially beyond 
— over scales ranging over two order of magnitude in the same process. This is the 
most challenging test of our understanding of nature at its most fundamental level 
ever performed. It is fair to state that while our most up-to-date tools, analytical cal- 
culations and simulations fare amazingly well in this comparison, some first cracks are 
showing that will motivate the community to push even further in the years to come. 


1.2.2 A user’s guide 


This book is meant to provide PhD students in experimental particle physics working 
at the LHC who have a keen interest in theoretical issues, as well as PhD students 
working in particle theory with an emphasis on phenomenology at colliders, a starting 
point for their research. It is meant to introduce and expose the reader to all relevant 
concepts in current collider phenomenology, introduce and explain the technology that 
by now is routinely used in the perturbative treatment of the strong interaction, and 
provide an integrated perspective on the results of such calculations and simulations 
and the corresponding data. 

The book consists of three parts. The first part is an overview of the relevant 
terminology and technology, worked out through one standard example and providing 
a coherent perspective on hadronic interactions at high energies. Readers and teachers, 
using this book for lectures, are invited to study Chapter 2 first before embarking on 
a more in-depth discussion of various theoretical or experimental aspects. The other 
two parts consist of a more detailed discussion of various aspects of the perturbative 
treatment of the strong interaction in hadronic reactions in the second part of the book, 
in Chapters 3-7. While these chapters frequently refer back to the overview chapter, 
Chapter 2, they are fairly independent from each other and could in principle be used in 


3Experiences from LEP and HERA have also been important but are not included due to space 
limitations. 
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any sequence the reader or teacher finds most beneficial. The third part, Chapters 8 and 
9, where core experimental findings are confronted with theoretical predictions, again 
is independent of the second part, although for a better understanding of theoretical 
subtleties it may be advantageous to be acquainted with certain aspects there. 

Finally, a list of updates, clarifications and corrections to this book is maintained 
at the following website: 


http://www.ippp.dur.ac.uk/BlackBook 


2 
Hard Scattering Formalism 


Before embarking, in this chapter, on a first discussion of the factorization formula 
and some of its immediate consequences in terms of actual phenomena and calcula- 
tions, in Section 2.1 an intuitive picture of high-energy reactions involving hadrons in 
the initial state will be developed. This picture in fact forms the physical background 
of the factorization formalism, which in turn provides the theoretical foundations of 
this book. 

In the next section, Section 2.2, the ideas formulated in the previous section will 
be further formalized and condensed into a discussion of the perturbative treatment of 
high-energy reactions at hadron colliders at fixed order. To illustrate the factorization 
formalism in action, the case of inclusive W-boson production will be analysed at 
leading and at next-to-leading order. 

In Section 2.3 the perturbative formalism developed so far will be further expanded 
to include the most important effects to all perturbative orders. This is achieved 
through the identification and subsequent resummation of the corresponding leading 
terms. Again, the case of W boson production serves as the main illustrative example. 

Finally, in Section 2.4, some general thoughts and issues related to the description 
of hard processes through perturbation theory will be deepened. 


2.1 Physical picture of hadronic interactions 
2.1.1 Electromagnetic analogy 


To illustrate the physical picture underlying factorization in processes initiated by col- 
liding strongly interacting particles such as protons at high energies, the simpler case 
of collisions with leptons in the initial state is considered first. In this, an intuitive 
understanding of the theory of strong interactions, QCD, quantum chromodynamics, 
will be developed in close analogy with the simpler theory of QED, quantum electro- 
dynamics. In both cases, although ultimately very different, the colliding particles will 
emit secondary quanta in an intricate radiation pattern, which must be taken into 
account to gain full theoretical control over the collision dynamics. 

Experimentally, for example in collisions of electron—positron pairs, it is of course 
typically not difficult to require that the energy in the actual collision is close to the 
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centre-of-mass energy of the colliding beams, thus effectively reducing the amount 
of energy carried away from the leptons through electromagnetic radiation. However, 
most of the time, especially when their combined initial invariant mass is above the 
mass of a resonance, such as the Z boson, the leptons will react with actual energies 
that are reduced with respect to their full available energy. This effect is sometimes 
called “radiative return” The corresponding energy loss is due to the emission of 
photons from the incident leptons, a process denoted as QED initial-state radiation 
(ISR). . 

While in QED the treatment of ISR potentially is a tedious but essentially straight- 
forward exercise, tractable with perturbation theory, in QCD the problem is much more 
involved and fundamentally different. This is because in QCD, the colliding particles 
cannot be interpreted as the fundamental quanta of the theory but rather as bound 
states, hadrons such as protons, which cannot be quantitatively understood and de- 
scribed through the language of perturbation theory. This conceptual gap necessitates 
the construction of a framework to provide direct and systematically improvable con- 
tact between the proven language of perturbative calculations of corresponding cross- 
sections and the non-perturbative structure of the hadronic bound states. This section 
is devoted to developing an intuitive picture of how this factorization framework 
actually works, by dwelling on the limited analogy with the emissions of secondary 
quanta in QED and the differences between electromagnetic and strong interactions 
when considering such collisions in greater detail. 


2.1.1.1 Equivalent quanta 


Classically, the phenomenon of initial-state radiation can be understood with the 
equivalent photon picture (523, 884, 892]. In its own rest frame, the lepton acts as 
the point-like source of a purely radial electric field, with no magnetic field present. 
This situation is depicted in the left panel of Fig. 2.1. Boosting the lepton into any 
frame where it is not at rest and, in particular, into the laboratory frame transforms 
the static source of an electric field into an electromagnetic current, which in turn 
produces also a magnetic field. With increasing lepton velocity, v — c, the two fields 
become increasingly confined to a plane perpendicular to the axis of motion and they 
also become more and more perpendicular to each other, with the electrical field ra- 
dial and the magnetic field circular around the axis of motion, cf. the middle panel of 
Fig. 2.1. This orthogonality allows the identification, in a straightforward way, of this 
classical configuration with quanta of the electromagnetic interaction, the photons, 
shown in the right panel of Fig. 2.1. The accumulated energy flux — that is, the total 
energy carried by the fields accompanying the lepton — is obtained by integrating the 
Poynting vector over the plane orthogonal to the lepton’s axis of motion. It is thus 
parallel to the lepton’s motion. Interpreting the continuous energy flux with a flux 
of equivalent quanta, the photons, the number density of accompanying photons per 
energy interval and distance from the lepton can be deduced as 


e, (2.1) 
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v=0 v=c vēžc 
Fig. 2.1 Electrical and magnetic fields in blue and green of a lepton at 
rest (v = 0, left panel) and with a velocity v ~ c (middle panel). The 
equivalent photons are depicted on the right panel. 


Here w is the energy of the equivalent photon, and the constant of proportionality 
is obtained by integrating over the transverse plane, parameterized by the impact 
factor b,, and by the electromagnetic coupling constant. The latter gets modified by 
the relative charge of the electron, ee, which of course equals -1. There is a maximal 
energy available for these equivalent photons; naively it is given by wmax = Ẹ, the 
energy of the lepton. 

In a more quantum field-theoretical way of thinking about this, the impact pa- 
rameter is replaced with the transverse momentum, b, 4+— kı, through a Fourier 
transform, and the equivalent photons are considered to be part of the lepton’s wave 
function. Their spectrum then reads 


a dw dk? 
di =. =. 22 
They m w k? ( ) 
In such a picture, the physical lepton is given by a superposition of states with varying 
photon multiplicity, 


le)phys = le) + ley) + levy) +... (2.3) 


where the photons have different energies and transverse momenta. Due to momentum 
conservation, they are typically off their mass shell. This limits the lifetime of the 
quantum-fluctuations like |ey) or |eyy). 

To see how this works in somewhat more detail, consider the case of the |ey) Fock 
state. Assuming massless on-shell electrons in the initial and final state, but allowing 
the photons to go off-shell, the kinematics of a splitting e(P) — e(p) + y(k) can be 
written as 

P=p+k —>0=2p-k+k’. (2.4) 
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Parameterizing the splitting of the energy, E of P such that w = E, = zE and 
therefore Ee = (1 — «)F for the energy of the outgoing electron, leads to 


k? ~ k? = k? (2.5) 


in the case where x is small, i.e., for the bulk of the photons. This also implies that 
in this limit the momentum component of the photon parallel to the electron is about 
kj ~ w, the energy of the photon. The fact that the emitted photons (or the electrons 
or both) are off-shell is equally true for massive electrons, and it limits the lifetime of 
the (ey)-component of the wave function through the uncertainty principle. With the 
photon momentum given by k” ~ (w,k,,w), the energy shift necessary to move it on 
its mass shell is given by dw ~ k? /2w yielding 
1 2w 2gE 


jw Ok vm) 


Ty os. 


as the lifetime of the fluctuation. This implies that such fluctuations live longer, 
as the energy of the photon increases and the relative transverse momentum of the 
photon and the electron decrease. Note that in this book natural units are being used, 
so effectively h = c= 1. 

In addition, of course, the photons can split into a fermion—anti-fermion pair, as yet 
another quantum fluctuation, which in turn may emit further photons. This compli- 
cates the wave-function picture even more. However, altogether, the distance from the 
original lepton and the lifetime of all these virtual particles, the quantum fluctuations 
which they manifest, are given by the amount they are off their mass shell and by the 
amount of energy they carry. 


2.1.1.2 Dokshitser—Gribov—Lipatov—Altarelli—Parisi equations 


The wave-function idea can be related, at leading order, by probabilities of finding 
the original lepton, the photons, or secondary leptons at given energies and transverse 
momenta. Due to the nature of the electromagnetic interaction, these probabilities 
(or, the lepton wave function), can be calculated from first principles. In the following 
the probability of finding a lepton or photon at energy fraction x (with respect to 
the original lepton) and at transverse momentum k will be denoted as (x, kų) and 
q(x, kı), respectively. At leading order in a, that is without any emission of secondary 
quanta, they are of course given by 


L(x, kå =0)=d6(1—2) and (a, ki =0) =0. (2.7) 


The radiation that actually gives rise to the aforementioned secondary particles can 
now be described, to all orders, by equations that are known as evolution equations, 
since they relate the probability to find quanta with certain kinematics x and k? to 
similar probabilities at other scales, through emission of secondaries. In general, they 
can be obtained in different approximations related to different kinematical situations, 
which can be intimately related to different factorization schemes. In the collinear 
factorization scheme, which will be employed in most of the remainder of this 


16 Hard Scattering Formalism 


book, the energy and especially transverse momentum of the secondary particles are 
considered to be small with respect to the energy of the original lepton, and therefore 
the kinematical effect merely amounts to a successive reduction of the original lepton 
energy. In this scheme, the probability densities evolve with the logarithm of the 
transverse momentum, cf. Eq. (2.2), as 


d(x, k?) _ alk?) fdg 
dlog k? Qn iS 


Pu (Z, olf) 6, #2) 
1 

dy(x, k? k2 d 

= ED E o Py E a(k2)) L(E, kî), 


where the effect of photons splitting into virtual lepton-anti-lepton pairs has been 
omitted. The first line of these equations exhibit how the probability density of leptons 
with smaller energy fraction x is driven by leptons with a larger energy fraction 2/€ > 
x, which can be interpreted as the leptons on the right hand side of the equation losing 
an energy fraction 1 —€ in the emission of a photon. In the kinematical approximation 
employed here, terms like Pee(x/E, a), the splitting kernels, can be understood as 
reduced matrix elements for the emission of one photon off the lepton. In general, Ppa 
denotes such kernels for a transition of a particle of type a to a particle of type b, while 
emitting a particle of type c, which is not been made explicit. These kernels have been 
taken at leading order, as manifest by the explicit order in a in front, but of course 
they can also be evaluated to higher orders in a perturbative expansion in powers of 
the coupling with corresponding coefficient functions. 

Closer inspection reveals that this set of equations is nothing but the celebrated 
Dokshitser—Gribov—Lipatov—Altarelli—Parisi (DGLAP) equation, specified for 
QED. In this equation, the splitting kernels at leading order are independent of the 
coupling constant and read 


ae + Saq >| 
Haa] 


z 


Pre (2) =< 


Here, the notation of “+”—functions has been employed, which will crop up again 
at various places, especially in conjunction with splitting kernels such as the ones 
discussed here. They are defined through their integral together with a test function 
g(z) such that 
1 1 
S aO = f dete) Io) - 901 (2.10) 
0 0 
For further details, the reader is refered to Appendix A.1.2. To gain a more intuitive 
understanding, consider this prescription to work in such a way that the pole for 
z — 1 in Po is excluded in the +-function and reinserted through the second part of 
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the splitting kernel, proportional to the 6-function. It will be seen later, how such terms 
become necessary to ensure the correct physical behaviour of the splitting functions, 
such as satisfying momentum conservation. 

However, these equations will be revisited and discussed in more detail in later 
chapters of the book. To follow the reasoning in this more introductory chapter it 
should suffice to mention that the 1/w pole present in the naive classical picture has 
its counterpart in the 1/(1 — z) or 1/z terms appearing in the splitting function. 


2.1.1.3 Initial-state radiation 


Turning away from the details of how the lepton wave function is evaluated and back 
to the physical consequences of the existence of the fluctuations, the obvious question 
is: what happens in a collision? Can the emerging picture be put to stringent tests 
and can the existence of such fluctuations be quantified by suitable probes? 

To answer this, at least qualitatively, and to gain some insight into the physical 
processes taking place in a collision, consider first the case of no collision at all. There, 
the fluctuations have a finite lifetime and distance to the original lepton, related to 
the kinematics of the virtual particles. Four-momentum conservation thus forces these 
fluctuations to eventually collapse back into the original lepton, guaranteeing the quan- 
tum coherence of the lepton — it must remain intact. If, on the other hand, a collision 
takes place, one or more of the quantum excitations, the particles forming the lepton’s 
Fock state, may, through the exchange of four-momentum with the incident projectile, 
go on their mass shell. In this way they acquire an infinite lifetime. In such a situation, 
the corresponding particles will not fall back onto the original lepton and the coher- 
ence of the fluctuations is therefore broken. In this way, one or more of the hitherto 
virtual particles may become real and hence physically observable in the final state of 
the collision. 

To illustrate this, consider the case of et e~ collisions, which have been investigated 
in great detail for example at LEP, and assume that it is the electron and positron that 
take part in the collision. Then the centre-of-mass energy Eom. of this actual reaction 
may be reduced with respect to the energy obtained from the nominal beam energies, 
Eom. < Eom. = 2Ebeam.' The difference is stored in the energies of the photons 
accompanying the electron—positron pair; their distribution in transverse momenta 
and energies is given approximately by Eq. (2.2). In more modern language these 
photons would be attributed to initial-state radiation off the incident “original” 
pair, rather than to the quantum manifestation of the breakdown of coherence of 
the Fock states describing the leptons and their accompanying electromagnetic fields. 
Nevertheless, this radiation would, of course, be described by equations very similar 
to the approximate one. 

It should be stressed, however, that, taken on its own, this initial-state radiation is 
a manifestation of the breakdown of quantum coherence of complicated Fock states, 
binding the photons to the original leptons. 


lHere, and throughout the book, kinematical quantities q related to the colliding partons are 
supplemented with a g, while those related to the (beam-)particles are left without it as q. 
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2.1.1.4 Final-state radiation 


A similar picture also emerges when charged particles are produced in the final state. 
As an example, consider the case of, say, muon pair production in electron-positron 
annihilations, ete~ — pt p-. Classically, the production of the muons can be under- 
stood as their instantaneous acceleration after emerging from a finite energy density 
related to the previous annihilation of the electron and positron into electromagnetic 
fields, the intermediate photon in quantum field theory. In classical electrodynamics 
such an acceleration of charged particles triggers the radiation of additional photons 
off the charges, known as Bremsstrahlung.? Interpreting the muon pair as a cur- 
rent going from a velocity V’ to v at the origin of the coordinate system, the double 
differential classical radiation spectrum J in the direction 7#(Q) with polarization € 


reads 
ay v v 
— € . 
dwdQ An? l-v-n 1-Wv-n 


in the dominant region of small energies, where the squared term is known as the 
radiation function W [645]. For massless particles travelling at the speed of light, 
the radiation function can be rewritten as 


2 


aI 2 
(2.11) 


1 — cos hyu’ 
(1 — cos nv) (1 — cos Any) ` 


Weg = (2.12) 


Cast in its covariant form and interpreted as a photon spectrum and tacitly inserting 
a = e?/ (4r), the radiation spectrum becomes 


iy sae A 
ie c rk pe z) 
for the number of photons N emitted in the process. Here, k and e denote the photon’s 


four-momentum and polarization four-vector, while p and p’ denote the four-momenta 
of the muons. The form above, 


2 dk 
(27)82ko 


ine 


(2.13) 


T 


W(p, p'; k eae ae 2.14 
(p, p’; ,€) = E ¥-#) (2. ) 
is also known as the eikonal form, and it is identical with the result of a full-fledged 
calculation in quantum field theory, as follows. 

For the case at hand, consider the photon emission part off the muons, which are 
assumed to be massless. The muons are produced through a vertex factor called r. The 
leading-order Feynman diagrams are depicted in Fig. 2.2. The corresponding matrix 
element for X > u~ (p)u*(—p’)y(k) is given by 


Mx spty-y = eti,- (p) Y (p + k)? (p' — k) 


In the quantum picture, equivalently, this translates into the radiation of Bremsstrahlung photons. 
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Fig. 2.2 Feynman diagrams for the emission of one photon by a muon 
pair at the lowest perturbative order. 
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(2.15) 
where the equations of motion 
pulp’) = u(p)p=0 and p’ =p° =k’ =0 (2.16) 


for massless particles as well as the anti-commutator relation {y“, y”} = 2g” have 
been used. Note that the only effect of the muon spin is the occurrence of emissions 
through magnetic terms, 4 [7", #], while the terms 2p” — k” do not bear any memory 
of the muons having spin. Going to the limit of soft photon emission, terms « k in the 
numerator vanish, resulting in 


P pe o | o y 
MXutu-y = ee, (k) A _ ma Up- (p )Puy+ (p) 
= eW(p, p'; k, €) Mxsptu-- 


(2.17) 


In other words, in the limit of soft photon emission, the emission term completely 
factorizes from the production of the system emitting the photon and is just given by 
the eikonal term W(p, p'; k, €). 

It is worth noting that soft photon emission is independent from the spin of the 
emitting particles and other internal properties and by its very nature it is a classical 
phenomenon. This is the essence of Low’s theorem [734]. As a consequence, soft 
photons can be thought of as essentially not carrying any quantum numbers, and, 
therefore, the emission of soft photons will not lift any veto — for example, due to 
C-parity or angular momentum — for a transition to happen. Such transitions, in the 
case at hand, for instance, from one helicity to another, necessitate the emission of 
a hard photon, described by the terms œ k. In contrast to the soft, classical terms, 
these essentially quantum-mechanical contributions do not exhibit any soft divergence 
dw/w but rather behave like dw: w. 

The simple classical picture of soft photon emission in principle allows a description 
of the pattern of photon radiation from the muon pair, by iterating emissions through 
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the eikonal term. This implicitly assumes that the individual emissions are indepen- 
dent from one other. The key point here is the introduction of the notion of photon 
resolution. The reason for this is the observation that the eikonal factor diverges for 
k — 0, the soft divergence, or for k parallel to p or p’, the collinear divergence. This 
is nothing but the well-known infrared-catastrophe of QED, a pattern that occurs for 
every theory with massless spin-1 bosons such as QED or QCD. Essentially it can be 
explained by the fact that it makes no physical sense to ask how many photons are 
emitted by a particle, without specifying how the photons are measured. In practice, 
photons can be too soft for a detector to respond; at the same time, if they are too 
parallel with the emitting particle they will end up in the same detector cell, which 
must have a finite size, and thus will not be distinguished. Therefore, the phase space 
of the photons must be constrained to detectable photons to make any sense. This 
is achieved by appropriate cuts, for instance by demanding a minimal energy and a 
minimal angle with respect to the emitting particle. 

There are now in principle two ways of how the full photon radiation pattern can 
be described: in a more direct approach, the integral of the eikonal factor over the 
constrained photon phase space could be used as the relevant term in a Poissonian 
distribution of the number of photons being emitted, the independence of the indi- 
vidual emissions guaranteeing the Poissonian character of this distribution. For each 
photon then, the phase space could be individually fixed. Alternatively, the emissions 
could be ordered in, say, the energy of the emitted photons, or, as a somewhat pre- 
ferred choice, the relative transverse momentum with respect to the emitter. This 
would allow a redefinition of the radiation pattern through a probabilistic picture, 
driven by the Sudakov form factor. In order to see how this works in more detail, 
cf. Section 2.3 and Chapter 5. 

In any case, it is worth noting that the eikonal picture here, condensed in Eq. (2.17) 
can be directly translated to the equivalent quanta picture above and the slightly more 
sophisticated version encoded in the DGLAP evolution equations, Eq. (2.8). 


2.1.2 Bound states, strong interactions and a, 


This simple, qualitative picture will now be extended to the case of incident hadrons 
- for simplicity they will be assumed to be (anti-)protons. Naively, they consist of 
three valence quarks, which carry the quantum numbers of the proton — electrical 
charge, spin, and isospin. In a naive picture, where the quarks just stick together 
with no discernible interactions responsible for it, it would be a fair guess to assume 
that they are all equally important and thus all carry the same energy fraction of the 
proton, namely 1/3. 

Switching on naive interactions, acting like rubber bands gluing the valence quarks 
together, one could assume that the original sharp distribution is washed out - but 
that the energy fraction distribution of the valence quarks still has a mean value of 1/3. 
Of course, in view of the previous discussion of the QED case, and because the QCD 
coupling constant is much larger than the QED one, this simplistic picture cannot 
hold true. In fact, taking into account quantum corrections to the strong coupling 
constant, a,, which are mediated by loop diagrams depicted in Fig. 2.3, the picture is 
a bit more complicated. 
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Fig. 2.3 The leading quantum corrections to the running of the QCD 
coupling constant as: gluon self energy at one loop order (ghost diagrams 
are ignored). 


2.1.2.1 The running of as 


Including such corrections, and denoting all couplings — QED and QCD — collectively 
with a = g?/(47) they vary with the scale ug, also known as (renormalization) 


scale, as 
2 Oa 
uh “SE — Bla), (2.18) 


where the -function 8(a) can be expanded in a perturbative series and reads 


a? 4 By 3 
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Here the customary relation a, = g2/(4m), and, similarly, a = e?/(47) has been 
assumed. 

In SU(N) gauge theory, the first coefficients 3; of the perturbative expansion of 
the 6-function read 
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where the number of active fermions is given by nf, and where the Casimir operators 
of the gauge group in its fundamental and adjoint representations are Cp and C4. 
It is worth stressing here that the first two coefficients, 59 and 6, are renormalization 
scheme-—independent, while all further contributions, starting with G2, depend on the 
renormalization scheme; the result given here for 2 is the one in the MS scheme. 
For a very brief review, cf. Appendix B.1. 

In QCD, they are given by 


and C4 =C,=N. (2.21) 
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and represent the colour charges of the particles. In addition, 


Tr = (2.22) 


1 
2 
has been used. 

To first order accuracy, and with the help of a reference scale Q, the solution of 
Eq. (2.18) can be written as 


2/2 2 

_ J (UR) a(Q*) 
(ue) = Re = Lor. (2.23) 

1+ a(Q?) Tr 108 Gt 
For the case of QCD, it has become customary to write the solution as 
2/2 
_ 9s (HR) 1 
as(uh) = ER = = (2.24) 
T Bo | HR 
An og A2 
QCD 


Agcp % 250MeV has been introduced as the QCD scale. Conversely, as could be 
fixed experimentally, for instance through a,(m?,) ~ 0.118 [799], taken from the PDG. 


The difference with respect to QED, at this level, can be traced back to the differ- 
ence in the 6-function, which for QED to first order reads 


Qn 


Bo = 3 (2.25) 


This difference in the overall sign, stemming from the gluon loops which of course 
are absent in the QED case, implies that, in striking contrast to the electromagnetic 
coupling, the strong coupling increases with increasing distance or decreasing scale and 
decreases with decreasing distance or increasing scale, a property known as asymp- 
totic freedom. In physical terms this phenomenon can be interpreted as the response 
of the vacuum to the presence of an electromagnetic or strong charge. In quantum field 
theory, the vacuum is not really an empty space, but it consists of a superposition of 
short-lived quantum fluctuations with total quantum numbers 0. In the case of QED, 
such fluctuations consist, to leading perturbative order, of fermion—anti-fermion pairs. 
In the presence of a fixed charged particle, say an electron, these pairs will not be 
completely unpolarized. Instead they will orient themselves in such a way that the 
positively charged fermions are closer to the electron, and the negatively charged vir- 
tual particles are further away. Trying to measure the charge of the electron by probing 
it with a photon thus introduces a scale dependence: a low-Q? photon will only be able 
to “resolve” a comparably large cloud of charged virtual particles, partially screening 
the charge of the electron. This can be seen as a close analogy to a charge in a dielectric 
material in classical electrodynamics, where the internal polarization fields partially 
counteract the electric field related to the charge. Increasing the virtual mass Q? of 
the photon will allow it to resolve smaller distances and thus probe the electron with 
diminished influence of the screening cloud. The same mechanism, of course, also ac- 
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counts for the case of QCD. Taking here a quark as a fixed charge, by far and large, 
the virtual quark—anti-quark pairs will orient themselves in such a way that the anti- 
quarks are closer to the quark. Similar to the case of QED, this effectively will lead to 
a screening of the fixed charge inside the cloud of short-lived virtual charged particles. 
However, in contrast to QED, also the carriers of the interaction, the gluons, carry 
colour charges and thus contribute to screening effects. Their contribution comes with 
a sign opposite to the fermionic contribution, in classical electrodynamics such a be- 
haviour would be classified with the pathological case of a dielectric constant smaller 
than unity. 

The effect of this running coupling of the strong interaction has been con- 
firmed experimentally, as shown in the pictorial summary of theory and experimental 
data in Fig. 2.4. For small scales, i.e. for ur —> Agcp, the running coupling diverges, 
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Fig. 2.4 The running of the strong coupling constant: experimental data 


confront theory, with the theoretical uncertainty shown as the blue band 
(Reprinted with permission from PDG [229]). 


signalling the breakdown of perturbation theory and any ideas of treating quarks as 
quasi-free particles. This is also known as the Landau pole of QCD. 


2.1.2.2 Equivalent quanta (QCD) 


Applied to the situation encountered in hadrons this means that there is a proliferation 
of particles, gluons emitted by the quarks and in turn splitting into gluon or quark- 
anti-quark pairs, at small transverse momenta, i.e., scales of the order of a few times 
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Acn: This radiation of secondary particles is even stronger than in electrodynamics: 
the 1/k? -term present in both QED and QCD, is further enhanced by multiplying it 
with the strongly increasing as(k? ) rather than with agen (k? ). In addition, since the 
gluons carry a colour charge, they also emit secondary gluons, such that the simple 
picture of QED Bremsstrahlung off an electron as introduced in Eq. (2.2) adapted 
for QCD becomes 


as(k?) dw dk? 


dn?9 = Cyg: 
g ae T w ke * 


(2.26) 


where the colour factors Cy,, = Cra again are the Casimir operators of the 
fundamental (Cr) and adjoint (C4) representation of the gauge group underlying 
QCD, SU(3). From the relevant equation, Eq. (2.21), it can be seen that in the large- 
N. limit, Ne — co, the colour charge of the quark becomes N,/2, half as large as 
the gluon colour charge, reflecting the fact that gluons carry two colour indices, while 
quarks carry only one. They are therefore, in this limit, twice as likely to emit a gluon 
than the quarks. Note here that it is quite often useful to consider the large-N, limit, 
as corrections to this limit are typically suppressed by 1/N2. Parametrically this is a 
correction of the order of 10%, but often, and in particular for sufficiently inclusive 
quantities, the effects are significantly smaller and can thus be neglected. 

It should be stressed, though, that due to the Landau pole the perturbative lan- 
guage must fail at transverse momenta around or below the Landau pole, with the 
result that the bound-state structure of the hadrons is more complicated than three 
quarks somehow bound together. Coming back to the qualitative picture, where hard 
interactions with constituents of the complex Fock state describing an incident fermion 
break its coherence, it is clear that something similar will also occur in the case of 
QCD. There are, however, a number of differences to this simple picture. First of all, 
the incident particle will be a hadron, a bound state in itself, with a difficult structure 
that cannot be calculated from first principles. Ignoring this complication, assume that 
at scales sufficiently high above the Landau pole, where a perturbative description be- 
comes sensible, the hadron is made of a number of valence partons. Apart from some 
colour exchange binding them together, these objects will start emitting softer quanta, 
in analogy to the Fock state picture in QED, which are also known as sea partons. The 
coherence of the hadrons enforces the recombination of these secondary quanta, and 
a picture like the one depicted in the left panel of Fig. 2.5 emerges. In the right panel 
of the same figure the effect of a hard probe, a photon interacting with the partonic 
ensemble is shown: one of the partons — the one the photon interacted with — is 
“kicked” out of the hadron. As a consequence the colour field is missing a quantum 
number, and the remaining partons cannot recombine anymore. The coherence of the 
hadron breaks down and a more complex final state emerges through a reconfiguration 
of the coloured partons. 


2.1.3 Hadrons in the initial state: The full picture 


Starting from a naive constituent picture (three quarks for a nucleon, like wud for 
a proton, udd for a neutron, or a quark—anti-quark pair for a meson, for example, 
ud/td for x), multiple emissions of secondary sea partons off these primary valence 
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Fig. 2.5 Hard interactions break coherence of QCD initial states, In the 


left panel, coherence of the stable hadrons enforces the recombination of 
emitted partons, in the right panel the interaction with a hard photon 
destroys this coherence by “kicking” out the parton it couples to. 


partons will occur, again governed approximately by Eq. (2.26). Similar to the case 
of QED, also in QCD, the lifetime of the sea partons is proportional to w/ k? and 
their distance from the emitting valence quark is given by 1/k,. The coherence of 
the quantum states, encoded as negative virtual mass, also guarantees, like in QED, 
that no partons can evade their hadron. In a hard collision, again, this coherence 
may break down if a parton is “kicked out” of the radiation pattern, and the partons 
that stay behind may go on-shell, thus becoming physically meaningful objects with 
observable consequences. Of course, also in QCD, the more modern way to look at this 
employs the notion of initial-state radiation, described by evolution equations such as 
the simplified ones encountered already in the naive discussion of QED, see Eq. (2.8). 


2.1.3.1 Deep-inelastic scattering 


In order to discuss this in more detail, consider the case of deep-inelastic scattering 
(DIS), where an electron interacts with a proton by exchanging a highly virtual 
photon. The process is called “deep” for the absolute value of the virtual mass of the 
photon, Q, being much larger than the proton mass mp or its inverse radius, Rp, 
which is of about the same size: Q >> mp ~ 1/Rp, and therefore probing its deep 
internal structure. It is “inelastic” if the electron loses enough energy for the proton 
to break up. To further analyse the process, typically the Breit-frame is employed; 
in this Lorentz frame the proton moves with large velocity, and the virtual photon has 
no energy. In other words, their respective momenta P and q are given by 


P! =(Py,0,P.) with P, = \/ P2 — m3 x P, 
o ae T aie (2.27) 
R 2 
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q” = xg P,(0, 0,0,2) p” = £tgP,(1,0,0, 1) 
p” = xzgP,(1,0, 0, —1) 


Fig. 2.6 Sketch of the photon-parton interaction in the Breit-frame. The 
parton only changes its momentum in the collision — its z-component 
effectively is reflected. 


Introducing the Bjorken variable xp as 


2 2 
q q 
e — Z 2.28 
"= Peg 2P.q: ve 


implies that q, = 2x g P,. A sketch of the Breit-frame in these parameters is depicted 
in Fig. 2.6. The time during which the interaction between photon and proton takes 
place is given by the (longitudinal) wavelength of the photon, which of course is the 
component being hit by the proton. Hence, 


1 
Tint ™ Àz NE (2.29) 


qz 

In order for the photon to see the partons inside the hadron, they must have wave- 
lengths at least as large as the photon, which is purely longitudinal; at the same time 
for the partons to see the photon, the photon’s wavelength must at least be as large 
as theirs. Therefore, their respective longitudinal wavelengths must be the same and, 
consequently, also their momenta must be of the same size: p, = qz. In the approxi- 
mation of quasi-collinear partons then the momentum-fraction x of the struck parton 
with respect to the incident proton must be of about the same order as xg. The life- 
time of these partons, as discussed in Eq. (2.6), is given by Tparton ~ pz/p?., which is 
larger than the interaction time Ting ~ 1/pz, provided that p; > p1. 

To summarize the discussion up to now: the parton picture of interactions devel- 
oped so far is a sensible construction. Demanding that the interaction time of the 
parton with the photon is much smaller than the lifetime of the quantum fluctuation 
that actually is the parton, yields a condition on the parton kinematics: the trans- 
verse momentum of the parton must be much smaller than its longitudinal momentum, 
p2 > p’.. In this setup the point-like photons measure the number of partons with 
similar longitudinal momentum in an area of size 1/Q?, or phrased in slightly dif- 
ferent terms, they probe partons with momentum fraction x % xg at scale Q?. The 
cross-section of this process must then be proportional to the sum of probabilities to 
find partons with this kinematics, that can interact with the photon, i.e., ignoring 
higher-order diagrams: 


Oep i Sea Oy, (2.30) 
q 


if the photon that is responsible for the interaction has a (negative) virtual mass 
squared with absolute value Q? and a longitudinal momentum given by z. 
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Furthermore, if this picture holds true — if p? >> pł — then the photon-parton 
collision can be treated as the collision of two independent quasi-free particles; the 
struck parton does not feel the presence of the surrounding partons forming the pro- 
ton, since the interaction time is smaller than the lifetime of this parton, i.e., the 
characteristic time after which the strong colour fields force the parton to recombine 
with the other partons. In such a framework the probabilities to find partons with 
a given momentum fraction x at a scale Q? inside a proton is process-independent, 
so the DIS setup with exchanged photons can be replaced by other processes with 
similar kinematics, such as Drell-Yan production, jet production, etc.. Thus, encoding 
these probabilities of finding a parton p in a hadron h as process-independent parton 
distribution functions (PDFs) f,/;,(«, Q?) is sensible. This is the physical basis of 
the factorization formula presented in the next section. 

The PDFs cannot, at the moment, be calculated from first principles, since they are 
truly non-perturbative objects. Of course, assuming the validity of the factorization 
picture, they can be measured in different processes and at different scales. These can 
be related to each other through the DGLAP equations, cf. Eq. (2.8) for the case of 
QED, which for QCD read, 


o Ce an) 
alog Q? \ farls, Q’) 
1 
_ as(Q?) I dz a Pag (3) | ae a) 
an z \ Poq Pog (F fank, Oy? 
where the sum over different 2ns quark flavours and their anti-flavours is implicit and 


will be further detailed in Chapter 6. Schematically, the convolution in Eq. (2.31) could 
be written as 


monet E) =a (pepe) © (RNG) e 


The kernels of the evolution equation, the splitting functions, are given, again, at 
leading order,® by 


(2.31) 


NIBNIB 


3For further reference, the splitting functions are decomposed here into a splitting kernel P and 
the anomalous dimensions of quarks or gluons, Yq,g, where applicable. 
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PY (a) = Cr [4 + ža 2) = [Pow] + yP- x) 
PRE) = Ta |e? +a- 2?) = POG) 
Pha (a) = Cr [He = PW (z) (2.33) 
PO (x) Ca | — el 2) 
| 11C4 RC 2) = [Pow] ce yM5(1— x). 


Here zx is the splitting parameter, essentially governing the light-cone momentum frac- 
tion of the offspring with respect to its emitter. It is worth noting that the indices 
of both Py, and of Ppa are related to a parton splitting process a — bc, where the 
type c of the third parton is fixed by a and b. The P\)(z) are also called regularized 
splitting functions to order o. 

A pictorial way to construct the anomalous dimensions is to analyse the splitting 
functions and to realize that they can be written as the sum of a part that diverges 
for z — 1 and some finite remainders which are linked to the 6-function from the + 
prescription. This link is given by sum rules, namely 


> j dzPi;(z) = 0 
#20 
IEE = 0 


1 
[aePaal2) = 0. 
0 


Here the first sum rule satisfies the condition that the splitting functions can be in- 
terpreted as probability densitiies for a splitting to take place, while the second and 
third one ensure flavour and momentum conservation. 

However, it should be stressed here that, strictly speaking, the picture above, 
called collinear factorization, which factorizes cross-sections of processes involving 
hadrons in the initial state into process-independent PDFs times the matrix element 
with quasi-free partons in the initial state, has been proved only for the cases of deep- 
inelastic scattering and Drell-Yan production [143, 409, 410]. The fact that the same 
ideas and the same formalism are also employed for other processes is in fact justified 
only by the success of this description rather than on a strict mathematical proof. In 
fact, recent work hints at the existence of rather subtle, factorization-breaking effects 
at higher orders [843], which are beyond the scope of this book. 
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2.1.4 Hadrons in the final state: The full picture 
2.1.4.1 Partons in the final state 


Similar to the parton distribution functions f,/p, (2, Q?), which at leading order encode 
the probability to find at scale Q? the parton p in the hadron h with a light-cone 
momentum fraction x with respect to the hadron, the fragmentation functions 
(FFs) D,/n(x, Q?) yield, again at leading order, the probabilities of finding the hadron 
or photon h emerging from parton p, at scale Q? and carrying the fraction x of the 
parton’s light-cone momentum. Similar to the PDFs, the FFs typically cannot be 
calculated but must be measured, and to complete the analogy, they enjoy the same 
kind of evolution equation, cf. Eq. (2.31), as the PDFs. 

Starting at some scale with a parton with some momentum, the DGLAP equation 
therefore describes, how, by successive emissions, momentum is carried away by the 
secondary partons, until the scale of where the hadron is measured is reached. This 
is, pictorially speaking, the inverse of the PDF situation, where, starting from a low 
scale, a parton constituent of a hadron with some momentum fraction evolves to the 
larger scale of a relatively harder scattering process via multiple emissions of secondary 
partons. 

Practically speaking there is yet another difference: in the case of the PDFs and the 
DGLAP equation for them, which is related to the evolution of initial-state partons 
from a known and fixed incident hadron, the final state is in many cases far less defined. 
In fact, the FFs come into play only, if the process signature is sensitive to finding a 
specific hadron in the final state — a situation in which the respective cross-sections 
are often denoted as single-hadron inclusive cross-sections. If, on the other hand, the 
presence of specific hadron species does not matter, then the FFs are typically not 
relevant. In such a situation, however, the DGLAP equations can still be used in 
order to describe the evolution of the partonic ensemble down to scales which are low 
enough for the perturbative description based on partons to cease being useful. The 
emergence of the then relevant degrees of freedom, the hadrons, however, cannot be 
treated from first principles and will rely on modelling with various levels of theoretical 
input. 


2.1.4.2 Emissions in QCD 


Compared to QED there is an important difference in the radiation pattern in QCD, 
driven by the fact that the gluons themselves carry colour charges, allowing them to 
radiate further gluons. Inspecting the form of the splitting function, Eq. (2.33), reveals 
that the emission of secondary gluons off gluons exhibits divergent structures in the 
splitting parameter z, namely 1/z and 1/(1 — z). It is in remarkable contrast to the 
finite behaviour of the splitting function for gluons or photons splitting into fermion 
pairs. This additional channel thus drives a further acceleration of radiation beyond 
the effect of the larger coupling in QCD compared to QED. From a theoretical point 
of view, ignoring the finite splitting of bosons into fermion pairs, the colour of the 
gluons means that the simple eikonal picture of iterating independent emissions in 
QED will not directly translate to QCD. While in QED, the eikonal for any photon 
emission is always spanned by the muons acting as eikonal partners p and p’, and 
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Fig. 2.7 Some Feynman diagrams for the emission of multiple gluons by 


a quark pair contrasted with corresponding leading colour diagrams. 


emissions of photons by photons are absent in the QED case, in QCD the gluons carry 
colour and thus can also act as gluon emitters. Then, after each gluon emission the 
structure of the eikonals will change, as the emitted gluons introduce new directions 
of the colour field. In the limit of infinite colours, also known as the large-N, limit, 
this leads to the notion of colour dipoles emitting the gluons. The emergent more 
complicated radiation pattern can be represented by some colour-flow diagrams of the 
type depicted in Fig. 2.7. 
The leading behaviour of the QCD emission process follows a pattern given by 


2 2 2 
dwt? = Gatki) CF an h + (1 = A | 


2m k? w (2.34) 
o a(k?) n dk? 1+2 _ a(k?) 2 dk? (1) l 
= Oe Cr ke dz = = on Cr kZ dz Pig (z) . 


for gluon emission off a quark with energy E. In the soft limit, for vanishing gluon en- 
ergies w = E(1 — z), this reproduces the dipole form in Eq. (2.26), which qualitatively 
is defined by a logarithmic distribution in both the gluon energy w and its transverse 
momentum kı with respect to the quark direction. They are often denoted as trans- 
verse logarithms, related to the collinear divergence or mass singularity for the 
logarithms related to the 1/k? term and as longitudinal logarithms related to the 
soft divergence or infrared singularity stemming from the 1/w part of Eq. (2.34).4 


4This automatically also leads to a qualification of parton emissions: for the production of a 
“jet”, a parton needs to be emitted at relatively large angles and energies, k], w ~ E, leading to 
dw! 49 ~ Cras(k? )/(27). Conversely, emissions not giving rise to additional jets are characterized 


by k1, w < E and therefore dw??49 ~ Cpas(k? )/(2m) log? E. 
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2.1.4.3 Interplay of final state radiation and hadronization 


As can be seen in Eqs. (2.26) and (2.34), the emission of massless final state partons 
leads to divergences of the form 1/k? and 1/w that must be cured. Typically, a cut on 
the phase space of resolvable emissions is applied, which in the case of photons could 
for instance be expressed by minimal photon energies and minimal opening angles 
with respect to its emitters. While in QED these cuts are often dictated by practical 
considerations, the technology and sensitivity of the detectors employed, the situation 
is different in QCD. There, a cut-off would typically be provided by the scale, at which 
the phase transition of the partons to the hadrons occurs. It has become customary to 
parameterize this scale through a cut on the relative transverse momentum k] of the 
emitted parton with respect to its emitters, and by the requirement that the minimal 
transverse momentum is sufficiently far above the Landau pole of QCD. Thus, such 
cuts are of the order of about p) x 1GeV > Agcp. This relatively rough estimate 
can be further refined by some simple estimates. 

Consider, again, the electromagnetic field accompanying an accelerated charge 
moving with velocity near the speed of light v ~ 1. While at asymptotically large 
times the electromagnetic fields will be the Lorentz-contracted discs of Fig. 2.1, right 
panel, it is also evident that this state would not just “jump” into existence, cf. [477], 
which also forms the base of the further reasoning below. 

Assuming the existence and acceleration of the charge started at t = 0 with v = 0, 
the field spreads out as a sphere with r’ < t’ in the (primed) rest frame of the charge. 
In the (unprimed) laboratory frame, of course, time dilation by a factor y = E/m 
the field at distances rı from the axis of movement of the charge will show up at the 
earliest at t = yt = Er /m. 

For quarks or other colour charges being produced at energy E in a hard interac- 
tion the situation is very similar. However, the reasoning there has far-reaching con- 
sequences: identifying with m the constituent mass of a quark, typically O (Agcp) © 
R! for light quarks and mg for heavy quarks, and with rı a typical hadronic size, 
R = O(1fm), immediately yields a hadronization time t‘**®) proportional to the 
energy of the quark: 

ER? for light quarks 
(had) J 
: ere for heavy quarks. ee 
mo 

Qualitatively, this naive picture emerges also in a quantum mechanical treatment. 
For hadrons of characteristic size R, the confining QCD forces are associated with 
gluon fields with typical momenta 


in the hadronic rest-frame. They emerge after typical times 


1 
ta Ra —, (2.37) 
ki 


which, including a Lorentz—boost factor as before leads to hadronization times 
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(2.38) 


where m is a typical hadronic mass scale; the result is the same as in the more classical 
treatment above. 

At the same time it is important to see how long it actually takes for a gluon to 
form in the emission off, say, a quark. Naively, this formation time can be estimated 
from the invariant mass of the quark—gluon pair and its energy as 


1 E E k kK 
Mqg Mag kEO, k? k? i 


pom) y (2.39) 


Here, the starting point is given by a combination of uncertainty principle and Lorentz 
time dilation effects, and in some intermediate steps a small opening angle of the 
resulting pair has been assumed. 

Comparing this with the hadronization time and demanding — reasonably — that 


gluons be formed before they hadronize, yields a constraint on the emission kinematics: 
EI g gom) < ghad) y fy 2 RSS ORA 2.40 
poar <t ~ Ry À L2 RT (few QcD) - (2. ) 


Extending this to the case of heavy quarks Q, such as top quarks, it is worth noting 
that in their case the lifetime given by 


3 

E E 

76% (=) < = m td (2.41) 
MQ Mq Mq 


is smaller than their hadronization time; they behave as truly free quarks. 


2.1.4.4 Hadronization 


At this low scale, QCD enters a new regime, where the strong interaction becomes so 
strong that the partons start feeling the effect of being bound together in hadronic 
states. As in the case of the initial state, the transition between the parton and the 
hadron description is beyond our current quantitative understanding. This, of course, 
is where the parameterization provided by the fragmentation functions, obtained from 
measurements, becomes important. There is, however, a significant difference with 
respect to the PDFs in the initial state. In the final state there may well be more 
than one hadron emerging from a parton; consequently the fragmentation functions 
describe the transition of the parton into the corresponding hadron plus any other 
hadrons. They are therefore mainly useful for describing the inclusive production of 
this one given hadron in a collision, ignoring all other hadrons which may eventually 
emerge as well. To reach a more inclusive picture, other methods are thus necessary, 
which will be introduced and discussed at later stages of this chapter and in Chapter 7. 
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Fig. 2.8 Scetch of a hard hadron-hadron collision at a hadron collider 
experiment, such as the LHC. It depicts the hard interaction as the central, 
red blob. The initial and final state particles experience initial and final 
state radiation, in blue and red, respectively, leading to a proliferation of 
a large number of secondaries. The underlying event — in purple — has 
other partons scattering and producing further activity through multiple 
secondary emissions. All emerging partons will at some point arrive at 
low scales, and hadronize, before the primary hadrons produced there will 
decay further (in green). 


2.1.5 Summary: space-time picture of hadronic collisions 


In the following, the different aspects important in hadronic collisions that have been 
discussed so far will be put into a wider, more coherent context. In order to supple- 
ment this discussion, in Fig. 2.8 an event at a hadron collider has been scetched. The 
following discussion is summarized in Fig. 2.9. 


2.1.5.1 Hard interaction 


Ordering this picture with the help of typical scales related to the momentum transfers 
at different stages of an individual event, the starting point is the hard interaction. 
Because, by definition, this is the hardest part of the event, the scales there are the 
largest, and this opens the possibility to describe it by fixed-order perturbation theory. 
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There, the relevant features of the event — the emergence and decay of heavy states 
such as gauge or Higgs bosons or top quarks or the occurrence of a number of hard 
QCD jets — can systematically be included. This is, at leading order and for small 
final state multiplicities, typically achieved by the well-known textbook methods of 
constructing the relevant Feynman diagrams, summing and squaring them by employ- 
ing completeness relations, convolution with the PDFs, and by finally integrating over 
the relevant phase space of the final state and over the momentum fractions of the 
initial-state partons with respect to the incoming hadrons. The last step is done with 
numerical methods, Monte Carlo integration, which is due to the non-analytical 
structure of the PDFs. For increasing multiplicities of final state particles, still at lead- 
ing order, the number of Feynman diagrams exhibits a faster than factorial growth, 
rendering the textbook methods prohibitively time-consuming, and numerical meth- 
ods must be employed throughout. For a summary of such methods, the reader is 
referred to Section 3.2.1. In order to improve the accuracy of such fixed order calcula- 
tions, of course higher perturbative orders must be included, which typically leads to 
additional problems due to the occurrence of ultraviolet and infrared divergences. For 
a more in-depth discussion of the overall mechanism of such fixed-order calculations 
the reader is referred to Section 2.2, and to Chapter 3, while in Chapter 4 specific 
issues related to different process will be detailed. The role of the PDFs, on the other 
hand, and how they are obtained from data, will be dwelt on in Chapter 6. 


2.1.5.2 Secondary emissions 


Having produced a number of particles at large scales, they will undergo some final 
state radiation, emitting Bremsstrahlung-quanta at lower scales. At the same time, 
the incident partons will experience initial-state radiation, due to the breakdown of 
the coherence of the Fock states related to the incident particles induced by the hard 
interaction. As can be seen, for example in Eq. (2.2) for the case of QED, these extra 
emissions may lead to logarithmically enhanced contributions, where parametrically 
large logarithms due to the kı integration compensate, at least partially, the small 
coupling. This leads to the rough, schematic distinction of two different regimes of 
parton emission: 


e a regime of jet production, where ky ~ ky ~ w and emission probabilities scale 
like w ~ as(k1) < 1; and 

e a regime of jet evolution, where k} < ky ~ w and therefore emission probabili- 
ties scale like w ~ ag(k) log? k? © 1. 


This implies that in this jet evolution regime, typical fixed-order counting of coupling 
factors alone ceases to be sensible, and it becomes more important to resum terms of 
the form a” log?”, a” log”, and so on. For an introduction into resummation the 
reader is referred to Section 2.3 and to Chapter 5 for more details. This additional 
radiation is particularly important for QCD particles, because, first of all, the strong 
coupling is larger than the electromagnetic one, but also because, in contrast to the 
QED case, the gluons are coloured and thus emit secondary gluons. Such an enhanced 
radiation manifests itself in the emergence of jets, which turn out to be the relevant 
objects to discuss QCD final states. In simulation programs, also known as event 
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generators, the resummation of these logarithms and the resulting jet structure in 
typical QCD events is achieved numerically invoking parton showers, cf. the second 
half of Chapter 5. 

Taken together, the logarithmically enhanced radiation pattern may have some 
sizable impact on the structure of the events, as it may very well change the kinematical 
distribution of particles in the final state. In addition, it should not be forgotten here 
that the experimentally observable objects are hadrons rather than the quanta of 
QCD, quarks and gluons, which introduces yet another layer of complication when 
discussing the full structure of hard collision events. 


2.1.5.3 Hadronization and hadron decays 


At the low scales of O (1GeV’), where the perturbative description breaks down, 
the impact of the transition to hadrons becomes important for a number of observ- 
ables. There are various ways of how to approach this problem. By now the most 
often used method is to rely on Monte Carlo-event generators which, aiming at a 
complete simulation of particle collisions including all facets, typically provide a phe- 
nomenological model that translates the partons into hadrons. These models usually 
contain about a dozen parameters which need to be adjusted (“tuned”) to existing 
data. In the construction of these models, various almost always rather qualitative 
ideas of how the phase transition proceeds are turned into some algorithmic pre- 
scription. Such models include independent fragmentation [137, 527, 636], sta- 
tistical hadronization [207], the string model [162, 163, 172, 281], and the cluster 
model [528, 528, 533, 896, 899], based on the idea of preconfinement [151]. 

Alternatively, analytical or semi-analytical descriptions are sometimes employed, 
which are often based on the analytical continuation of the strong coupling, as into 
the non-perturbative regime and on employing the kinematics of perturbative parton 
splitting also in this infrared regime. Examples include power corrections [481, 483, 
691, 887] and have mainly been employed in the description of hadronization effects 
in electron—positron annihilations into hadrons and deep-inelastic scattering. 

In many cases, the hadrons that emerge in the hadronization process are unstable 
and decay further, leading to an additional proliferation of particles in the final state. 
This is typically dealt with in event generators, where, typically, for lighter and better- 
known hadrons such as the 7, 7’, p’s, w, or ¢ hadrons, the corresponding branching 
ratios can be taken from the experimental results published in the PDG, subject to 
some rescaling to ensure that the branching ratios add up to exactly unity. For heavier 
or less well-known states this, of course, ceases to be an option, and additional decay 
channels must be invented and branching ratios must be fixed by considerations in- 
volving, among others, phase-space or flavour-symmetry arguments. The kinematical 
distributions of the final state particles are quite often fixed by assuming isotropic 
decays — this may be a poor approximation. Therefore, in some cases matrix ele- 
ments from effective theories such as (Resonance) Chiral Perturbation Theory 
(xPT) [494, 495] or from spin consideration only are invoked. At this point it is 
worth noting that the hadronic decays of the 7-lepton are treated in a similar fash- 
ion, with matrix elements either stemming from RyPT [491, 492, 844] or from the 
Kühn-Santamaria model [703]. 
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For the decays of hadrons with heavy quarks, quite often only those final states are 
known, which involve a few final state particles only, which then can be taken, again, 
from the PDG. There, kinematical distributions typically are taken from suitably 
chosen matrix elements and often involve additional form factors, both obtained from 
Heavy Quark Effective Theory (HQET) [497, 640, 745, 784, 813, 847, 848]. For 
high-multiplicity final states (which in the case of, say, the B-mesons amount to about 
50% of the total decay rate), often the decay is treated by going back to the parton 
level. In these cases, the parton-level matrix elements are supplemented with parton 
showers and hadronization. For a more detailed description, the reader is referred to 
Chapter 7. 


2.1.5.4 Underlying event 


An additional complication, when discussing collisions with hadron initial states, arises 
from the fact that hadrons are extended objects, which are composed from a multitude 
of partons. In standard textbook formalism, this is reflected by employing PDFs to ac- 
count for the transition of the incident hadrons into the partons which then experience 
a hard scattering. However, this formalism does not account for the possibility that 
more than one parton pair coming from the two hadrons may interact. This effect, 
also known as double-parton scattering or multi-parton interactions, in fact 
is beyond standard factorization; in the absence of a first-principles approach it thus 
must be modelled. Up to today, even the simplest models, which are based on a sim- 
ple factorized ansatz of merely multiplying the partonic cross-sections and including 
a trivial symmetry factor, appear to be in agreement with data. This suggests that, 
possibly, probable correlation effects when going from one- to two-particle PDFs or 
non-trivial final state interactions are not dominant and can be treated as some kind 
of higher-order correction to the simple picture. 

Apart from such hard double or multiple interactions there are other, softer addi- 
tional interactions when hadrons collide — some of them will just fall into the category 
of multiple-parton interactions at lower scales, where no hard objects like jets with 
some tens of GeV in transverse momentum are produced. In addition, there is another 
contributor to the overall particle multiplicity, namely the remnants of the incident 
hadrons. After one or more partons have been extracted from them, the parton en- 
semble forming them typically cannot combine back into a single colourless object but 
instead must hadronize into a set of hadrons. Quite often, however, the two beam 
remnants appear to be connected in colour-space which must be included into the 
reasoning. In addition, although the partons are by far and large moving in parallel 
to the beam, i.e., to the hadron they form, there is no reason to assume that they do 
not have some transverse momentum of the order of Agcp up to some few GeV. This 
intrinsic transverse momentum of the beam-remnant partons guarantees that the 
hadrons stemming from their hadronization do not necessarily vanish along the beam 
pipe, but can instead reach the detector due to their finite transverse momentum. 

For a further discussion of current models, the reader is referred to Chapter 7; 
some comparisons to data are presented in Chapters 8 and 9. 
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2.1.5.5 Pile-up 


Another important effect in hadron collisions, especially at the LHC, is the interaction 
of multiple pairs of protons in the same bunch crossing, the so-called pile-up. This 
is driven by the instantaneous luminosity of the collider experiment in question, and 
while at maximal luminosity at the TEVATRON there were about 5 proton—anti-proton 
interactions per bunch crossing, at the LHC about 25 pairs of protons are expected 
to interact in each bunch crossing at design luminosity and the centre-of-mass energy 
Ecm. = 14 TeV. For a possible SLHC, this number would go up to about 250 proton- 
proton collisions per bunch crossing. In addition to that, the size of the detector allows 
for the possibility that the particles produced in more than one bunch crossing interact 
with different parts of the detector at the same time, an effect known as temporal 
pile-up. At the LHC, with the anticipated maximal frequency of 40 MHz, there will 
be about three such generations of particles at the same time in the detector. 


2.1.6 Definition of physical objects: leptons, photons, and jets 


Before turning to calculating the first example cross-sections in perturbation theory 
and analysing their behaviour, especially at higher order in QCD, in this section proper 
definitions of relevant physical objects will be discussed. 


2.1.6.1 Leptons 


While naively this discussion may seem a bit awkward, it is important to understand 
why this is a relevant issue. To gain some understanding, consider first an example in 
electrodynamics, namely the production of Z bosons and their subsequent decay to 
a lepton pair. Without QED final-state radiation, and in an ideal world with perfect 
detectors, the invariant mass of the lepton pair would roughly follow the Breit-Wigner 
form expected from resonance production, of course modulated by PDF effects. Their 
combined four-momentum, due to energy conservation, would of course be identical to 
the intermediate boson’s state. In turn its transverse momentum and rapidity could 
be directly reconstructed. This, however, is not how reality presents itself. Instead 
the lepton will emit photons that follow, at leading order, the eikonal pattern already 
discussed in Section 2.1.1, resulting in a logarithmic enhancement of soft and collinear 
emissions. Consequently, the lepton four-momentum is reduced and, even for ideal de- 
tectors, their reconstructed kinematics would typically differ from those in the absence 
of such QED FSR effects. There are various strategies to deal with this apparent prob- 
lem. Of course it is possible to take such effects into account by including them in the 
calculation. A popular way of achieving this is employed in some modern Monte Carlo 
simulations. It is based on the algorithm of Yennie, Frautschi, and Suura [901] and 
allows for a resummation of leading logarithms in an energy cut-off and an angular 
cut-off, which can be systematically improved by fixed-order calculations. Since this 
algorithm also provides a transparent way of guaranteeing four-momentum conserva- 
tion, it is particularly well-suited to describing the kinematic effects on leptons due to 
photon emissions in both the initial and the final state. Emissions below the cut-offs 


5For some specific realizations of such infrared-safe phase space mappings between states before 
and after QED radiation, see for instance [838]. 
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Detector Event 


Hadron-hadron collision 


Parton-parton collision 


hardest interaction: the “signal process” 
description: fixed-order perturbation theory (Chapters 3 and 4) 
important input: PDFs, factorization theorem (Chapter 6) 


Initial- & final-state radiation 


emission of secondary particles 

description: resummation or parton shower (Chapter 5) 
important: matching to fixed order & logarithmic precision 

note: final state radiation is perturbative part of “fragmentation” 


Underlying event 


multiple parton-parton interactions 

description: 2 — 2 parton-parton scatterings in perturbation theory 
or other QCD-inspired models (Chapter 7) 

treated only in full simulations, comes with further parton showering 


Hadronization 


transition of partons to primordial hadrons 
description: fragmentation functions in calculations 
non-perturbative models in simulations (Chapter 7 


Hadron decays 


decay cascades of the primordial hadrons 
description: data, effective theories, symmetries, models (Chapter 7) 


multiple hadron—hadron collisions, typically soft. 
description: models, often QCD-inspired (Chapter 7) 


Fig. 2.9 The various components of a hadronic collision event as seen 
in the experiment. Typically, in one collision, a number of hadron-hadron 
collisions occur, most of which are soft and as “pile-up” blur the picture of 
the one hadronic collision of interest. This usually has a hard component, 
the partonic “signal event”, which is calculable from first principles in per- 
turbation theory. The interacting particles undergo initial and final state 
radiation, before the partons hadronize. The emerging primordial hadrons 
then decay further to the particles finally visible in the detector. 
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are considered to be too soft or too collinear to yield any visible effect; conversely, such 
photons can practically be taken as recombining with the original lepton. However the 
singularities related to their emission cancel the corresponding soft and collinear di- 
vergences in the virtual contributions, to all orders. This is yet another manifestation 
of the KLN and BN theorems, cf. Section 2.2.5. 

On the other hand, the production of leptons, especially in hadronic collisions, may 
essentially proceed through two mechanisms. Direct production, the case discussed up 
to now, is where the leptons directly stem from the hard interactions. However, lep- 
tons may also emerge in the decay of hadrons and, in particular, in the weak decay of 
heavy hadrons containing charm or bottom quarks. In fact the leptons there play an 
important role in the identification of such objects, with the finite lifetime of weakly- 
decaying heavy particles resulting in displaced vertices measurably different from 
the primary one. However this production channel may also mimic direct production 
and its characteristics. Typically the production cross-sections of heavy flavours con- 
voluted with corresponding decay branching ratios are orders of magnitude larger than 
the cross-section for direct lepton production, implying that this is an issue that must 
be dealt with carefully. A straightforward solution relies on two considerations. First, 
weak decays of heavy hadrons producing the leptons also yield other, lighter hadrons. 
In addition, more often than not, the heavy flavour is part of a larger system containing 
more hadrons — a jet — that will be discussed further shortly. Especially for leptons 
with a transverse momentum larger than the mass of typical heavy hadrons, these 
two effects mean that the lepton more often than not is part of a roughly collimated 
bunch of other particles, mostly hadrons. As a consequence, demanding leptons to be 
isolated in rapidity and transverse angle from any hadronic activity will significantly 
reduce the impact on an analysis of leptons produced in hadronic decays. This isola- 
tion requirement is typically realized experimentally by demanding that the sum of 
the hadronic energy or the transverse momentum of other charged tracks in a radius 
Reyit around the lepton be smaller than a critical value. Here, the distance AR;; 
between two objects 7 and j is given by 


AR = An}, + Ag; = (yj = mi)? + (6) - 08)? (2.42) 


with 7 and ¢ the pseudorapidity and the azimuthal angle, respectively. 

At this point some terminology needs to be defined. The lepton four-vectors pro- 
vided by a fixed-order parton-level program are termed to be at the Born level. A 
bare or undressed lepton is one that has undergone QED radiation, it has lost en- 
ergy due to photon emission. A large fraction of that photon radiation is relatively 
collinear with the lepton direction (corresponding to about 3% of the lepton energy 
carried away by radiation off of electrons and 1.5% for radiation off of muons) and a 
smaller fraction is emitted at wider angles.” 

This is highlighted in Fig. 2.10, which demonstrates the effect of photon radiation 
on the lepton in the decay of a W boson. The emission of a single photon depletes 


6 Of course the term Born level here refers to Born level with respect to QED corrections, so leptons 
in higher-order QCD calculations would still be coined as Born level in this sense. 


"Note that the QED corrections are strongly kinematic and phase-space-dependent. 
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electrons 


average E/E, [%] 


Fig. 2.10 The effect of photon radiation on the average lepton energy in 


W boson decay, as a function of the angular separation between the photon 
and the lepton. 


the energy of the lepton (measured in the rest frame of the decaying W boson) by a 
fraction that depends on the angular separation of the photon and the lepton (AR,,). 
This fraction is larger for electrons than muons due to the collinear enhancement that 
goes with the lepton mass (mz) as log(me/my ). For nearly-collinear photons, in the 
region AR < 0.1, the fraction of energy radiated away is around 2.5% for electrons 
and 0.7% for muons. Emission at wider angles is responsible for approximately 1% of 
additional depletion in both cases. 

In an experimental lepton reconstruction algorithm, photon radiation is absorbed 
inside a cone of small radius around the lepton, typically AR < 0.1 using the definition 
in Eq. (2.42). The resulting lepton four-vector, that has been corrected for collinear 
photon radiation effects by such an algorithm, is designated as a dressed lepton. 
There is still a residual correction of around 1% (for both electrons and muons) to 
account for wide angle photon emission. Often, data and theory comparisons are made 
at the dressed lepton level, if the theoretical calculation derives from a Monte Carlo 
program, which is capable of including photon radiation effects. For comparison to 


fixed-order calculations, the data typically is corrected for QED radiation to the Born 
level. 


2.1.6.2 Photons 


Similar considerations also hold true when considering the production of photons. 
Once again, the dominant contributions stem from secondary mechanisms of photon 


Physical picture of hadronic interactions 41 


production rather than direct production in the hard interaction. Such secondary 
photons can be emitted as final-state radiation off quarks or, more often, originate 
from the decay of hadrons. In the latter case the major source of concern is not tied 
to the production and decay of heavy flavours, which typically gives rise to other, 
secondary hadrons in addition to charged leptons, but the annihilation of neutral, 
mostly light mesons in processes such as 7° — yy or n — yy. These particles are 
created in abundance in hadronic final states and, since both they and their decay 
products are neutral, they can only be seen calorimetrically. Because they are so light, 
their decay products more often than not end up in the same calorimeter cells when 
looking for high-p, objects, which makes it fairly tricky to disentangle them from 
single photons. This is particularly relevant for signals containing primary photons, 
which are directly produced in the hard process and thereby have large backgrounds 
from such secondary production channels. This adds another level of complication with 
respect to the lepton case, but the overall solution remains the same. 

Again, in order to disentangle direct (or prompt) photons from those emerging 
in the fragmentation or decay of strongly interacting particles, isolation criteria are 
introduced. In past years, it has become customary to use the Frixione isolation 
criterion [540] in theoretical calculations. This criterion is defined by a cone with 
opening angle ĉo around the photon in 7-@ space, a critical exponent n and a scaling 
factor €}. Photons are considered isolated from the hadronic environment, if the 
accumulated hadronic transverse energy inside any cone with size 6 < ôo, weighted by 
the distance from the photon, is smaller than the photon’s transverse energy multiplied 
with a cone-size-dependent weight: 


1 — cos ô 


X 1.1008 = AR.) <e Bis | | V5 < ðo. (2.43) 


1 — cos ô 
i€hadrons D 


It is worth noting that the modifying factor on the right-hand side of the equation 
above enjoys the property that 


, | 1 — cos ĝ i 
lim | ————_ | =0. (2.44) 
60 | 1 — cos do 

This guarantees that in the strictly collinear limit any hadronic emission will lead 
to the photon being considered as not isolated. At the same time, the scaling with 
energies on both sides ensures that the case of partons splitting into partons in some 
final-state radiation process does not hamper the isolation criterion too badly — it will 
remain sufficiently infrared-safe. 

From the theoretical perspective, the use of the Frixione prescription greatly sim- 
plifies any calculation. It removes the need for including non-perturbative photon 
fragmentation contributions that describe the parton to photon transition, cf. Sec- 
tion 2.1.4. Since these fragmentation contributions are a purely collinear phenomenon, 
they are explicitly removed when using the Frixione approach. However, experimen- 


8Customary choices for the parameters of this algorithm are, for instance, €y = 1l, n = 1 and 
ôo & 0.5, mimicking the typical cone-size of jets, see the section about jets, 2.1.6.3. 
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talists have typically not used the Frixione approach for photon isolation either at 
the TEVATRON or at the LHC, not least because the condition in Eq. (2.43) cannot be 
directly implemented for a detector with finite granularity. Instead an isolation cone 
is defined, typically of radius R=0.4, and any transverse momentum contained in the 
isolation cone (not assigned to the photon) is required to be less than either a fixed 
value, or a percentage of the photon tranverse momentum. The isolation definition is 
designed to remove most of the jet fragmentation background to photons, while having 
a high efficiency for retaining true, isolated photons. While the primary purpose of the 
isolation algorithm is to remove the jet backgrounds, it also has the effect of effectively 
removing much of the photon production from fragmentation processes. 

Another procedure, also known as a democratic approach, is based on treating 
photons as if they were jets and applying a jet clustering algorithm on all outgoing 
particles. In this method a photon is isolated, if it would be forming a jet and if its 
hadronic content contributes less than a critical value to its overall energy or transverse 
momentum. In the next section, such jet finding and clustering algorithms will be 
discussed in more detail. 

Ultimately both algorithms can be used in parton-level calculations and in hadron- 
level simulations or measurements, facilitating a direct comparison between the two. 
In reality the energy in the isolation region around any photon candidate is dominated 
by underlying event energy, and at high luminosities, by pile-up. There are techniques 
to effectively subtract this energy that is either completely (in the case of pile-up), 
or partially (in the case of the underlying event energy) uncorrelated with the hard 
scatter, see [586]. However, the stochastic uncertainties in the subtraction tend to over- 
whelm any fragmentation energy that may be in the cone. Given these uncertainties 
it is therefore not unreasonable to use Frixione isolation in theoretical calculations for 
comparison to data on which typical experimental isolation cuts have been applied. 


2.1.6.3 Jet algorithms: general considerations 


The situation becomes even more complicated when trying to analyse the hadronic 
components of final states. As already discussed in Section 2.1.4, strongly interacting 
particles, quarks and gluons, experience significant final state radiation, driven by the 
large logarithms related to the usual soft and collinear divergences when emitting 
massless quanta. The radiation is further enhanced by the size of the strong coupling, 
which is an order of magnitude larger than the electromagnetic one. This last effect is 
even more pronounced for emissions at relatively small scales, due to the faster running 
of the strong coupling. The resulting ensemble of QCD quanta fragments into hadrons, 
many of which are unstable and decay. This leads to a massive proliferation of hadrons, 
which, however, tend to “clump” into fairly energetic clusters, called jets. While this 
picture is very intuitive qualitatively, a more quantitative way of dealing with this 
phenomenon is necessary. One obvious requirement is that any reliable solution must 
allow the direct comparison of perturbative calculations on the parton level, based on 
first principles, with the experimental reality. Thus quantitative descriptions, called 
jet algorithms, must be applicable at the level of partons, observable hadrons, and 
even energy deposits in calorimeter cells. In addition, it must be guaranteed that the 
chosen jet algorithm holds water irrespective of the actual perturbative order. Simply 
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put: considering the same observable at a higher perturbative order typically involves 
additional parton emissions, which actually may be soft and/or collinear with respect 
to already present partons. These additional and potentially unresolved emissions must 
not lead to unwanted artifacts such as ambiguities in the actual number of jets observed 
or their position in phase space. If this requirement is satisfied, the jet algorithm is 
called infrared-safe. 

In this section, the focus will be on jet definitions and algorithms applied at the 
perturbative level, i.e., using partons. Some of the difficulties that result when hadrons 
are clustered to jets instead will be discussed in Chapter 8. 

At leading order, each jet is modelled by a single quark or gluon. As already noted 
in Section 2.2, in general this leads to infrared singularities corresponding to kinematic 
configurations in which two partons are collinear, or a gluon is soft. It is only after 
the application of a jet algorithm, which ensures that all partons are both sufficiently 
hard and well-separated, that any sensible theoretical prediction can be made. 

In general, jet algorithms can be considered as constructed from two ingredients. 
First, all objects belonging to a jet must be identified, for which there are essentially 
two categories of algorithm. One is based on purely geometric considerations and pro- 
ceeds by identifying a jet axis and assigning all objects within a radius Ro around this 
axis to the jet. In contrast, sequential algorithms proceed by combining and clustering 
pairs of particles in turn until only hard quasi-particles identified with jets are left. 
Second, the momenta of the jet constituents must be combined to yield the overall jet 
momentum, which in modern algorithms is realized by recombination schemes acting 
sequentially on four-momenta. These two stages in principle are independent, but in 
sequential clustering algorithms they are, to a certain degree, intertwined. 

It is crucial to describe this construction of jets from their constituents in terms of 
an appropriate set of kinematic variables. The standard set consists of the transverse 
momentum (p1 ), rapidity (y), azimuthal angle (¢), and invariant mass (m) of the jet.? 


2.1.6.4 Cone algorithms 


Cone algorithms are the prime example for assigning objects to jets on geometric 
grounds and typically work as follows. In a first step, a seed is found, defining the 
direction of the prospective jet centre or centroid (ne, $c) in the n-¢ plane.!? For 
instance, such a seed could be any particle in the event — an idea typically used in 
parton-level calculations — or any calorimeter cell with an energy deposit larger than 
some critical energy, an idea usually used for hadron- or detector-level analyses. Then 
a circle with radius Ro is drawn around the centre and all particles i with radius 
Ric < Ro are assigned as members the prospective jet. Its overall momentum can 
then be defined as the vectorial sum of the four-momenta of all its constituents. Such 
candidate jets are then accepted if their transverse momentum is larger than some 
critical value, pË > Pi crit- As a by-product, of course, the jet position (n, 6), 
its mass, etc., can readily be evaluated. 


9Early studies of jets at the TEVATRON did not employ this set, for example using Er rather than 
pr, and 7 rather than y, and largely ignoring the information provided by the jet mass. 

10In past usage, cone jet algorithms tended to use pseudo-rapidity (7) rather than rapidity (y). 
Modern jet algorithms all use y. 
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Fig. 2.11 A kinematic configuration demonstrating the lack of a proper 
all-orders definition of the simplest cone algorithm.The addition of a soft 
parton acting as a potential jet seed between the two original cones “pulls” 
the hard partons in, resulting in one rather than two jets. This is a typi- 
cal example for infrared-safety problems in perturbation theory. Reprinted 
with permission from Ref. [508]. 


This is a very simple algorithm but, like many nice ideas, it is rather too simplistic 
and it cannot be used to describe the full wealth of features predicted in QCD without 
introducing further complications. A first problem arises when trying to find more 
than one jet. Depending on the parameters of the collision and the cone algorithm, it 
is not unlikely that in a multijet environment jets may overlap, leading to particles 
that may in principle contribute to more than one jet. Such assignments would most 
certainly lead to unwanted features like non-conservation of momentum and therefore 
must be avoided. A simple way of doing this is by ensuring that, for instance, the 
harder jet includes the particles in the overlap zone. So, this problem can to some 
degree be solved algorithmically. 

More complications arise when considering seeded cone algorithms. As an example 
at the parton level, consider the situation depicted in Fig. 2.11, where two hard partons 
i and j have a relative distance Ro < Rij < 2Ro. As long as these are the only two 
partons in the vicinity, they will form two separate jets since Rij > Ro, however, the 
addition of a soft gluon may completely change the picture. Using it as a potential 
seed it is entirely possible that both partons have a distance smaller than Ro, thus 
being sucked into one combined jet, which has a larger total energy and momentum 
and will therefore be accepted as a candidate. So, basically, the presence of additional 
radiation at higher orders allows new seed directions about which jets can be formed. 
The appearance of such new jet axes, especially if it is due to the presence of arbitrarily 
soft radiation, will cause real and virtual corrections to fall into bins of different jet 
multiplicity. This will quite often hamper the mutual cancellation of the associated soft 
singularities and in such cases will invalidate the perturbative calculation. In general, 
this kind of breakdown is termed a problem with infrared-safety. The consequence of 
this is that although simple cone algorithms are perfectly feasible at the hadron level 
they have no relevant theoretical interpretation in terms of perturbation theory. This 
renders the comparison of theory and experiment, using these algorithms, seriously 
flawed at best. 
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A remedy to this problem of the simple cone algorithm above is to introduce addi- 
tional seed directions throughout; this was the philosophy underlying the introduction 
of the “Midpoint cone” algorithm. In this algorithm, extra seeds are placed between 
every pair of stable cones having a separation of less than 2Ro, twice the size of the 
clustering cones. However, a careful analysis reveals that such solutions only post- 
pone the infrared-safety problem to some higher order in perturbation theory, which 
of course means that they are not very satisfying. Considering all possible directions of 
jet cones would eliminate any infrared-safety problems to all orders; but while this is 
feasible for a low-multiplicity parton final state, it becomes computationally expensive 
for true experimental data, or indeed parton shower predictions, tamed only by the 
granularity of real-world detectors. In summary, this problem of infrared-safety ren- 
ders “seeded” cone-algorithms in principle problematic for any meaningful comparison 
between theoretical calculations and experimental data. In practice, however, a careful 
analysis of the midpoint and SISCone algorithms (see below), as applied to jet physics 
at the TEVATRON, shows only marginal differences between the two. This ultimately 
motivates the use of TEVATRON data with jets defined through the midpoint algorithm 
which was the standard there. 

A practical solution maintaining the idea of perfectly cone-sized jets was found by 
abandoning the idea of seeds and replacing them with innovative geometrical methods 
to reduce the computational complexity [836] and resulted in the Seedless Infrared- 
Safe Cone (SISCone) algorithm. This algorithm suffers none of the limitations of 
its forebears but retains a relatively intuitive physical picture. However, in the first 
years of data-taking at the LHC another class of jet finders, the kr-algorithms became 
the standard tool. 


2.1.6.5 kr algorithms 


The kr family of algorithms [304, 345-347, 480, 511, 898] defines jets through a pro- 
cedure that follows an idea rather different from the one employed in simple cone 
algorithms. The latter use a predefined, regular shape — the cone — to capture the 
relevant objects in the event, either partons, hadrons, or calorimeter entries. The kr 
algorithm instead uses these basic objects as inputs from which to build jets in a 
recursive, iterative way. When these kp-algorithms were originally proposed in [345— 
347, 511], they were constructed in such a way as to follow the natural pattern of QCD 
radiation. Such a procedure may of course lead to jets with fairly irregular shapes. With 
the introduction of a less well motivated clustering algorithm in form of the anti-kr 
algorithm [304], this possible obstacle was overcome and the kr-type of jet algorithms 
was established. 

To cluster the objects in ky-algorithms, it is necessary to introduce a generalized 
distance measure in momentum space, namely 


p 


dip = (p1)? for each object i 
: Rij 
dij min { (pii)”? , (p13)? } 2 


Ro 


(2.45) 


Il 


for each pair of objects 2 and 7. 


The clustering now proceeds iteratively, where in each step the object(s) i (and j) 
with the smallest d are clustered, either with the beam, if dig is the smallest, or with 
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each other, if dij is the smallest. This is repeated until all distances d are larger than 
a critical value deut, which in turn defines the relative transverse momentum two jets 
have with respect to the beam or to each other. The generalized form of the algorithm 
discussed here is thus controlled by two parameters, one denoting the cone-like size of 
the jets (Ro), and one specifying the power to which the transverse momenta entering 
these expressions are raised (p). The Ro parameter here introduces a useful discrepancy 
in how relative transverse momenta impact on the jet definition: for typical values of 
Ro < 1 this implies in particular that the relative transverse momentum between two 
jets, which is given by 

pi = min {pri pis}? Riy (2.46) 
must be larger than their transverse momenta with respect to the beam, which ef- 
fectively translates into a different treatment of initial and final state QCD radiation 
when analysing the structure of the overall radiation pattern, where the former is more 
susceptible to relative transverse momenta with respect to the beam and the latter is 
more sensitive to the relative transverse momentum of two final state objects. 

The original kr algorithm [346, 511] corresponds to the choice p = 1 in the 
above criteria, and in this realization jets are clustered together in order of increasing 
transerse momenta. The Cambridge—Aachen (CA) algorithm [480, 898] uses p = 0 and 
therefore clusters in a sequence of proximity in 7-¢ space.'! In both specific imple- 
mentations jets tend to have a fairly irregular shape, which renders any subtraction 
of hadronic activity due to pile-up or to the underlying event a cumbersome task. In 
contrast, the variant in which p = —1, known as the anti-kr algorithm [304], results in 
fairly regular-shaped jets. Because of this feature, by now it has become the preferred 
choice in experimental analyses in recent years: it combines infrared-safety with fairly 
conical jets. 

A variety of jet sizes are commonly used with the anti-kr jet algorithm at the 
LHC, but unfortunately differ between ATLAS (0.4, 0.6) and CMs (0.5, 0.7). The tools 
now available, though, both for jet calibration and jet reconstruction, should allow for 
physics analyses to be carried out with multiple jet sizes and multiple jet algorithms. 


2.1.6.6 Choice of jet size 


Within an NLO calculation, a jet can consist of either one or two partons. The phase 
space in which two partons are included in the same jet is different for cone and kr 
algorithms using the same jet size parameter. Define d as the separation in 7 — ¢ 
space between two partons and z as the ratio of the transverse momentum of the 
softer parton over the harder parton. The (d,z) phase space is shown in Fig. 2.12. 
In Region I, the two partons are closer than the jet radius R; both partons will be 
included in the same jet for both a cone and a kr algorithm. Partons in Region II 


11Note that in their original form, both the kr [345] (or Durham) and the Cambridge—Aachen 
algorithm were defined for e~e+ annihilations into hadrons. In this realizationx, of course, Rij is 
replaced by cos @;;, Ro = 1, and the role of the transverse momenta in Eq. (2.45) is played by either 
the particles’ energies or the absolute value of their three-momenta. It is interesting to also note that 
for e~et+ annihilations into hadrons another jet definition based on a clustering according to the 
(pi +p;)?, the Jade-algorithm [204], was a popular choice. However, it turned out that this algorithm 
was inferior to the other two in its behaviour when mapping out the structure of QCD radiation and 
therefore providing a link between parton-level calculations and hadron-level data. 
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1.0 1.0 


Fig. 2.12 The phase space for the two partons that can form a jet in 
a NLO calculation, parametrized by z = pr.2/pr, (pr. > pr,2) and 


d= vin y2)? + (¢1 — $2)”. Reprinted with permission from Ref. [508]. 


will not be clustered into the same jet by either algorithm, while partons in Region 
II would nominally be clustered into the same jet with a cone jet algorithm, but not 
with a kr jet algorithm. Thus, a cone jet effectively has a larger catchment area than 
a kp jet with the same size parameter. In practice, with real data or Monte Carlo 
events, Region II is truncated for cone algorithms. This will be developed further in 
Chapter 8. 

Inclusive measurements involving jets at the TEVATRON tended to involve relatively 
large jet sizes, typically with jet radius R = 0.7, such that most of the energy of the 
jet is included within the jet radius [61, 86, 119, 122]. For complex final states, such 
as W +n jets, or tt production, it has been useful to use smaller jet sizes, in order to 
resolve the n-jet structure of the final state [59, 60, 81, 85, 88, 96, 99, 112]. 

Each jet size comes with its own benefits and drawbacks, both theoretically and ex- 
perimentally. A smaller jet size reduces the impact of pile-up and the underlying event, 
but fragmentation effects are inversely proportional to the jet size, thus becoming more 
important as the jet size decreases. In addition, as R decreases, terms proportional to 
log R start becoming important, requiring a resummation of such terms (not present 
in fixed-order calculations) for a precise prediction. 


2.1.6.7 Boosted objects and fat jets 


With the advent of the LHC, another question related to how to deal with jets has 
emerged. It refers to situations where heavy particles, such as gauge or Higgs bosons or 
the top quark, are produced at transverse momenta much larger than their rest mass. 
Their decay products will, in such circumstances, be fairly collimated. If these heavy 
objects decay hadronically this will thus result in jet-like structures, also known as 
fat jets. To disentangle such boosted objects from ordinary jets, a careful analysis 
of their structure is mandatory. The first ideas of how to achieve this were presented 
by Seymour [842] and have received renewed attention more than a decade later. This 
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was mainly driven by the work of Butterworth and collaborators [297], who showed 
that it may be possible to use such techniques to find a Higgs boson that decays to 
b-quarks at the LHC in gauge-boson associated production and thereby reviving an 
important search and measurement channel in Higgs boson physics. One of the key 
observations is that the angular separation of the decay products depends on the Higgs 
boson transverse momentum in a simple way. 

To see this, one can write the Higgs boson four-momentum as p = p; + pj, with 
p? = mê, and i and j the two decay products. Choosing a frame in which one of the 
decay products is directed along the x-axis, the momenta p; and pj are, 


i (cosh n, cos ¢, sin ¢, sinh n) , 


Š (2.47) 
Pj = PL (1, 1, 0, 0) , 


In this frame, 7 has rapidity 7 and azimuthal angle while j has zero rapidity and 
azimuthal angle. Therefore the angular separation of the decay products is given by, 


AR; = Vn +, (pitp;j)? = Qp'p) (cosh n — cos) . (2.48) 
If both 7 and ¢ are not too large, i.e. i and 7 are collimated, a series expansion yields 


(pi +p)? = pip) (1? +8 + O(n', 64) 
= pip’ (AR)? + O(n’, 6"). 


(2.49) 


In the limit in which 7 and j are highly collimated one can replace the four-momenta 
of the decay products by the Higgs boson four momentum scaled by an appropiate 
energy fraction, i.e. p; = zp and pj = (1 — z)p. From Eq. (2.49) it is then clear that, 


m3, 2(1 — 2)? (ARy)? , (2.50) 


and hence that, 
mH 


yz(= 2z)pı l 


This also shows that there is an intrinsic angular scale to such a configuration since the 
separation is bounded below, within the approximation, by the value 2m /p1. This 
observation paved the way for more detailed studies of boosted objects and jet sub- 
structure. Some of the recent ideas and studies that are of particular phenomenological 
importance are summarized in Refs. [106, 124, 144]. 

All jet algorithms and a number of jet substructure algorithms discussed in this 
section are incorporated in the FASTJET plugin [305]. 


2.2 Developing the formalism: W boson production at fixed order 
2.2.1 Factorization formula 


In collinear factorization introduced and motivated above, cross-sections for the hadronic 
production of an n-parton final state in a reaction of the type hiha => n+ X with 
hadrons hı and hə in the initial state can be written as 
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a,b 0 


1 
1 
= [Avastin fajhı (Za, HF) fojno (£0, UF) z fo IMab—nl?(@nj UF, HR) 
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1 
1 dza dx 
= Js > / 22 fans (Las HF) oshal (xb, uF) ) [oe EG Peers ba (® ni LF; LR) - 
0 


(2.52) 


A number of objects enters this master formula, namely: 


o The fan(x, u) denote the parton distribution functions (PDFs), which depend on 
the light-cone momenta fractions x parton a has with respect to the hadron h, 
which incorporates it, and on the factorization scale up.!? The x also connect the 
centre-of-mass energies squared of the hadronic collision E?,, (hih2)? = s with 
its partonic counterpart, E2 „ (ab) = § through 


$ = Lqkps. (2.53) 


At leading order, fa /p(x, p) can be interpreted as probabilities to find a in h with 
momentum fraction x at the (space-like)'? resolution scale  — an interpretation 
deeply connected to the case of deep-inelastic scattering, where factorization has 
been proven and where PDFs have traditionally been defined and measured. With 
this probabilistic interpretation at hand, the integral over the momentum fractions 
x is well understood. 

e The parton-level cross-section, déapn (ur, uR) which is given by the product 
of the incoming partonic flux 

1 Maro 1 1 


= 2.54 
4y/ (Pa Po)? — p2p 25 2XaXys m 


of the massless partons a and b, and the integral of the partonic transition ampli- 
tude squared, IM absn|?(®n; LUF, HR), over the available n-parton phase-space 
element, d®,,. 14 This phase-space element is given by 


aba = [I | Pen ate? -meo eosto tm- (258) 


12For some reminder of basic scattering kinematics, the reader is referred to Appendix A.3. There, 
the light-cone decomposition of momenta will be discussed. 

13Systems with total four-momentum squared Q? < 0 are called space-like, those with Q? > 0 are 
time-like, and those with Q? = 0 are called light-like. 

14Here and in the following, quantities at parton level are denoted by a circumflex (for example, 
&), while quantities at hadron level are left without it. 
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Each term in the square brackets is the Lorentz-invariant phase-space element of 
a particle i with momentum p; and mass m,;. The additional 6-function at the end 
implements total four-momentum conservation. 

e The factorization scale wr and the renormalization scale upg are process- 
dependent quantities. The former, up signifies the scale at which the parton dis- 
tribution functions are determined, while the latter, upr indicates the scale at 
which the coupling constants are evaluated, thus taking into account the quan- 
tum nature of the underlying theory. Typically, for simple processes which are 
characterized by one scale only, up and up are assumed identical to this scale. As 
an example for such a process, consider the case of W-production. On the parton 
level this is given by qq’ — W; since the incoming partons can safely be taken 
massless, the only meaningful partonic scale in this process is the centre-of-mass 
energy Ec.m. = V8 of the partonic system, which for an on-shell W boson is just 
the W-mass, my. However, for more complicated processes, and especially for 
those with multiple QCD emissions, picking a suitable scale is far from being 
trivial, cf. Section 2.2.6 for a further discussion. 


Some general considerations on these objects will be presented in the following sections, 
employing the classical example of the production of a single vector boson. 


2.2.2 Parton Distribution Functions 


Having discussed the basic idea behind the factorization formula in Section 2.1 and 
its formalization in the master formula Eq. (2.52) for the calculation of scattering 
cross-sections at hadron colliders in Section 2.2.1, it is now time to discuss in a bit 
more detail the structure of the parton distribution functions (PDF's). They act as the 
link between the incoming hadrons in such a scattering and the incident elementary 
quanta, the partons, which are employed to calculate the cross-sections. 


2.2.2.1 Valence quarks and simple sum rules 


Concentrating on the case of incident protons, one could naively assume that they 
consist of three valence quarks. If they did not interact, there would be no reason to 
assume why one of these valence quarks would be preferred over the others. Therefore 
their respective PDF's would be given by 


fup 42) =26 (2-3) 


Sajel%s p’) = ô (=- 5) G 


with all other PDFs being exactly zero. This would guarantee the flavour sum rule, 
given by 


(2.56) 


1 
fæ (sage p’) a. Taje(% w)| = 2 
0 
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dx [faal p’) aL lapel®s »°)| =1 


dz [fajpl®, u) Tees uw) = 0 for q € {s, c, b}, (2.57) 


oN OSes 


and the momentum sum rule, 


J; 


I dax X` fiyn(@, H’) =1 Yp? and for all hadrons h, (2.58) 
2 i 
where i runs over all partons, {u, ū, d, d, ..., g}. 


Switching on elastic interactions between the quarks, but still neglecting any emis- 
sion of additional quanta, would not change this picture dramatically. The only effect 
of such a form of interaction, which can be thought of as some kind of “rubber bands” 
holding the quarks together, would be to smear out the sharp peak at x = 1/3 and 
replace it with a probability density such that its expectation value would be 1/3: 


(alanti) z A (2.59) 


2.2.2.2 QCD effects and scaling violations 


A drastic change, however, occurs, when the emission of quanta without their imme- 
diate reabsorption is considered, along the lines of what was discussed in Section 2.1. 
As has been shown, these additional partons, which are summarily denoted as “sea” 
or “sea-partons”, would have a finite lifetime that increases with decreasing momen- 
tum fraction x. This leads to these sea partons typically having larger probability 
distributions at small rather than at large values of x. In fact, it is worth analysing 
the kinematics of parton emission as described by the QCD analogue of the splitting 
functions given in Eq. (2.33). 

They will lead to gluon emissions off valence quarks typically taking place for values 
of z close to 1, i.e. for small momentum fractions (1 — z) of the gluon with respect to 
the quark. As sea quarks as well as additional gluons emitted by these gluons inherit 
their kinematics, the perturbative production of sea partons favours them to have 
small values of x. Consequently, the combination of lifetime arguments and the form 
of the splitting functions induces non-zero sea PDFs, which typically steeply increase 
for decreasing x. Naively, and to a fairly good approximation, 


fosato ia p’) x gra , (2.60) 


where à ~ 1 for gluons and sea quarks and A ~ —1/2 for valence quarks. This behaviour 
is contrasted with the cases of no or elastic, “rubber-band”-type interactions in a 
sketchy way in Fig. 2.13. Apart from the increase for small x due to the sea partons, it is 
worth noting that also the distribution of valence partons, which for elastic interactions 
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Fig. 2.13 Sketch of the up-quark PDF for different interactions: valence 
quarks only, without and with elastic interactions and full QCD. 


is centred around x ~ 1/3, shifts towards smaller x values of about 0.1 and broadens 
out. This of course is due to them losing energy due to the emission of sea partons. 
And in fact, while the PDF -— from a CT10 NLO set [551] — displayed in Fig. 2.13 
for an u-quark is taken at relatively low scales of up = 10 GeV, this slight “valence 
bump” at x ~ 0.1 persists also to higher scales. 

This picture of secondary parton radiation indeed is very similar to the case of 
photons emitted by an electron. There, the photons play the role of the sea, allowing 
the photons to split into electron—positron pairs supplements this sea with additional 
fermions. 

Putting this together yields a picture of the proton in the z-Q? plane as sketched 
in Fig. 2.14. The behaviour of the PDFs is entirely calculable with perturbative meth- 
ods, and is in fact given by the Dokshitser—Gribov—Lipatov—Altarelli—Parisi 
(DGLAP) equation for QED, cf. Eq. (2.8), with the starting condition 


feje(z, 0) =6(1—2) and f,/-(z, 0) =0. (2.61) 


In contrast to this fully known and entirely perturbative QED case, the case of QCD is 
more complicated, owing to the essentially non-perturbative infrared structure of the 
theory. There, the perturbative regime is bound from below; typically it is assumed 
that perturbative QCD breaks down for scales of the order of 1 — 2 GeV and below. 
This implies that the starting conditions for a DGLAP evolution must be taken from 
data, a subject that will be the focus of a more detailed discussion in Chapter 6. As 
already stated in Section 2.1.3, the scale evolution of the PDFs is given by the DGLAP 
equation, Eq. (2.31), where q denotes all quark flavours q and their anti-quarks. 

A consequence of this scaling behaviour is that for increasing scales u the sea 
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Fig. 2.14 Sketch of the proton in the z-Q? plane. At large x (~ 1/3), the 
number of partons in the proton remains roughly the same, given by the 
valence partons. In contrast, at smaller x and larger Q? significant scaling 
of the parton number takes place. The “transverse size” of the resolved 
partons roughly scales with 1/Q. 


contribution is logarithmically increasing, a phenomenon known as scaling violation. 
Pictorially speaking, the probability of finding a parton within a parton increases by 
probing with larger transverse resolution power, as provided by the scale. In this 
respect, partons are not elementary objects, i.e. point-like particles. Rather, they are 
objects which carry the quantum numbers of the fundamental fields of QCD, quarks 
and gluons, but they cannot be interpreted as “naked” particles at all scales. In other 
words, partons are non-elementary particles, with a complicated internal structure, but 
they behave like fundamental quanta if probed at a low enough scale. In this respect, 
the partons are very similar to the colliding electron of the electromagnetic analogy 
in Section 2.1, which is a superposition of various Fock states involving secondary 
quanta. 


2.2.3 Partonic cross-section at leading order 


The production of a lepton-neutrino pair €~i or lve in hadron collisions provides 
a great starting point for a more in-depth discussion of particle production in such 
processes. In the Standard Model, it is related to the exchange of a single W* boson, 
with subsequent decay into a lepton and its neutrino. 

These processes are mediated by the weak interaction. Due to the underlying elec- 
troweak symmetry, the weak coupling gw can be related at LO to the electromag- 
netic coupling through 


e 


o= [4v2Gr m|? = (2.62) 


sin Ow 
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with the Weinberg angle 0w and the Fermi constant Gy. Numerically, the elec- 
tromagnetic coupling in the Thompson limit and at the Z-pole are roughly 


1 
— fe 0 
eu) J7 P? 
al) =E x (2.63) 
N 1 
T5 for u=mz, 
the sine of the Weinberg angle is about 
sin 0w ~ 0.23 (2.64) 


and the Fermi constant approximately is given by 
Gp ~ 1.166 - 1075 GeV-?. (2.65) 
2.2.3.1 Matrix element for on-shell W production 


As a first step, consider the matrix element for the production of an on-shell W+ 
boson. At leading order, i.e., in the approximation of tree amplitudes only, there is 
only one diagram, and the corresponding matrix element reads 


B uf), (2.66) 


where the spinor arguments and subscripts indicate their momenta and colour indices 
and V,,q is the relevant element of the Cabibbo—Kobayashi—Maskawa matrix. 
This amplitude leads to the summed and squared expression 


= 3 |ẹVual gF Pl eee Q Qu 
X Minw-l? = mi a H Pa BY 7 Juv + m3 
2 42 2 42 i (2.67) 
= [Vira Iw Q? 2 [Vaal Iw me 
12 12 ws 


where Q = pı + p2 and the invariant mass of the boson, § = (pı + p2)? = 2(pip2) = 
Q? = mi, have been introduced. The factor 3 stems from the sum over three possible 
quark-line colours, the 1/9 takes care of taking the average over all possible colour 
configurations of the quark and the anti-quark, and the factor 1/4 reflects the average 
over the incoming quark spins. 


2.2.3.2 Matrix elements for ud > vel 


The two diagrams displayed in Fig. 2.15 relate to the two different charge states W+ 
and W~. At leading order each of them is the only relevant one for each of the charge 
channels. The matrix element for the process ud > lve, W+ production and decay, 
reads 
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Fig. 2.15 Production of a W~ boson and its leptonic decay at leading 
perturbative order. Here u and d stand for arbitrary up- and down-type 
quarks, respectively. 
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(2.68) 


The terms in the first line correspond to the respective left-handed fermion currents, 
made manifest by the short-hand notation 


q= 
T 2.69) 


while the second line represents the W propagator connecting them. 
A similar expression can be found for the case of W~ production, by suitably 
permuting the labels of the fermion spinors. Squaring yields 
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(2.70) 


where the average over the initial quarks’ spins and colours and the sum over the 
lepton spins in the final state is implicit. Here the Mandelstam variables 


§ = Q? = (Pu + pg)” and t = (Pu — Pz)” (2.71) 


have been employed. 
Rewriting the phase-space integral over the outgoing particles as 
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where Q% is the solid angle of the outgoing lepton with respect to the incident u-type 


quark in the rest frame of the collision. In the same system, the i = —2pu -pz becomes 
Ê = —2p,,-pp COS —5(1—cos6"), (2.73) 


where 0* is the polar angle of the lepton with respect to the incoming u-type quark. 
This allows to rewrite 
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(2.74) 
and thus 
tal 1 
o nt = re r J awas 7 2 
pee 5767 [5 — my] + myy 
(2.75) 
x X` tufujh (Eus te alae, Ca HF)| 5 
u,d 
where the integral over the energy fractions has been written as 
dé 
dx,dzz = z dyw. (2.76) 


This is because the x, g can be related to the centre-of-mass energy squared and the 
centre-of-mass rapidity through 


Ly (2.77) 


In other words, the cross-section is written as an integral over the invariant mass 
squared and rapidity of the produced system. 

Going a step further allows to calculate a differential cross-section with respect to 
the lepton rapidity. The trick here is to relate its rapidity gj in the partonic c.m.- 
frame with its rapidity yz in the hadronic c.m.-frame. This is fairly straightforward, 
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since rapidities are additive along the same boost axis and therefore 
yz = ĝe + yw - (2.78) 


For massless particles, such as the lepton in the process here, also rapidities coincide 
exactly with pseudorapidities, therefore, in the partonic c.m.-frame 
0 1 1 + cos 0* 


1 =l t — l 2. 
Ye ae 2 2 ne 1 — cos 6* Co) 


or j 
inĝ* = ; 2.80 
T cosh ĝz ( ) 


This means that 
dcos 6* = sin? 0*dĝz = sin? 0” dyz. (2.81) 


Finally, therefore 


an2 MFG - - 
sin“ 0 deity, (2.82) 


do bv 
Suut J dedeaturn, (£u, LF) fa/ng (Tā, LF) dcos ĝ* 


dyz 


2.2.3.3 Narrow-width approximation 


The result in Eq. (2.75) can be further simplified by noting that the propagator — 
a Breit-Wigner form — suppresses values of ê away from my. This observation is 
manifest in a simplification known as the narrow-width approximation (NWA). 
In this approximation, the propagator factor for an internal particle X with mass Mx 
and width T x is replaced according to 


ds T 


dé 6(8 — M? 2.83 
G- M2} + Mary MxTx g ae) 


where the overall factor outside the 6-function ensures that the replacement does not 
change the value of the integral. Applying such an approximation will result in a sharp 
mass distribution of the decay products of the propagator particle. Depending on the 
actual measurement, and, of course, the values of the internal particles mass and width, 
this then may yield unphysical results. As a rule of thumb it can be argued that this 
may be the case if a measurement of kinematic observables of the decay products is 
more accurate than Ty /Mx. In NWA, 


4 2 Ymax nee ae 
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(2.84) 
where the W rapidity yw is constrained by £uxgs = mj, and therefore 


1 s 
lyw| < Ymax = 5 log ey es (2.85) 
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Quite often, the fact that this is an approximation only is further aggravated by 
also ignoring the Lorentz structures in the numerator of the propagator, thus effec- 
tively forfeiting any knowledge of eventual correlations between initial and final state 
particles. In such a case, the matrix element squared for a typical 2 —> 2 s-channel 
process ab + X —> cd with an intermediate particle X (like the one studied here, 
W-production and decay) becomes proportional to the respective branching ratios: 


IM|20+x+ca X BRx ab BRX>ca. (2.86) 


2.2.3.4 W forward-backward asymmetry 


Alternatively one could ignore the decay of the intermediate particle and focus on its 
on-shell production. Employing this approximation for the production of the W boson 
for the moment, it is possible to discuss the on-shell production of a W boson. Its 
matrix element squared at leading order is given by 


2 Va Zid 
Mg ye e e rie (2.87) 


resulting in the parton-level cross-section 
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where in the last line the implicit integration over § with has been used together 
with the 6 function and where gw = e/sin@w has been employed. The production 
cross-section in hadronic collisions thus becomes 
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(2.89) 
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where E = ys is the hadronic centre-of-mass energy and the limits of the rapidity inte- 
gral ymax = 5 log oor follow from momentum conservation, § = 7,075 = my and 
W 


the relation of the «1,2 with the rapidity of the produced system Yem., cf. Eqs. (2.76) 
and (2.77). For future reference it is also useful to introduce 


2 2 
(Lo) _ Tgw |Vual 
so that 
Ymax 
LO LO 
eee = o (s) f dyw D fum (Zu, UF) fajra (xg, HF). (2.91) 


Zmak u,d 

The result above, Eq. (2.91), implies that the rapidity distribution of the W boson 
is entirely defined by the PDFs, and, at leading order, by the quark PDFs only. While 
the sea is more or less flavour symmetric, the valence contribution is not. In protons 
(anti-protons) there are twice as many valence u quarks (ù anti-quarks) than d quarks 
(d anti-quarks). This has the following implications: in proton—anti-proton collisions, 
like at the Fermilab TEVATRON, both species of W bosons, Wt and W~ bosons, have 
exactly the same production cross-section. However, Wt bosons tend to fly more likely 
in the direction of the protons, the forward direction, while the W~ bosons tend to 
follow more likely the direction of the anti-protons, the backward direction. This is 
because they are more likely to obtain a strong “kick” into the respective direction by 
an up-type rather than a down-type valence quark. In proton—proton collisions like at 
the CERN LHC, this picture does not hold true any longer. There, the cross-sections 
for Wt production is larger than the cross-section for W~ production — in fact, if in 
both cases a valence quark would have to be involved, the would differ by a factor of 
two. This is not true, since there is a sizable contribution from the other sea quarks 
and, at higher orders, from incident gluons. Also, their rapidity distributions are not a 
reflection of each other around central rapidity any longer, but each of them of course 
is symmetric under reflections around y = 0. In addition, the W* bosons tend to have 
a slightly larger rapidity, due to the higher probability to obtain a strong “kick” from 
an incident valence quark, while the W~ bosons are more central. This behaviour 
and the comparison of it at the TEVATRON and the LHC at different c.m.-energies is 
exhibited in Fig. 2.16. It is worth noting that the shape at symmetric proton—proton 
collisions also changes quite dramatically, as the c.m.-energy of the colliding hadrons 
increases from 8 to 100 TeV. This is due to the fact that with increasing energies the 
x the quarks need decreases, leading to the contributions of the sea-quarks becoming 
larger and larger. The valence quarks, when annihilating anti-quarks from the sea, lead 
to a pronounced boost of the W boson, and, thus, a depletion in the region of central 
rapidities. This region is being filled by the more symmetric annihilation of pairs of 
sea partons. At 8 and 14 TeV this leads to the plateau-shape of the distribution. This 
plateau of course widens in rapidity with the energy of the colliding hadrons. At 100 
TeV, however, the sea contribution takes over shaping a mount at central rapidities. 

This behaviour is completely driven by the PDFs and the interplay of valence and 
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Fig. 2.16 Rapidity distributions of W~ bosons at the TEVATRON, at the 
LHC at c.m.-energies of 8 and 14 TeV, and at a future hadron collider with, 
a c.m.-energy of 100 TeV. The calculation has been performed at leading 
order with the CT10 PDF [713]. 


sea contributions at high and low values of x, respectively. In fact, measurements of 
the W*-W~— asymmetries are a very useful way of constraining high-x valence quark 
PDFs [551]. However, the simple picture outlined above is not easy to translate into a 
measurement, the problem of course being that the W bosons decay into a lepton and 
an invisible neutrino. The latter makes it very hard to reconstruct the W kinematics, 
since neutrinos manifest themselves as missing transverse energy at hadron colliders, 
which leaves the charged lepton only. The corresponding lepton asymmetry thus 


reads 
(Z) _ (Z 
dye+ dye- 
= 2.92 
A (2) 7 (#2) oe) 
dye+ dye- 
where the ye+ are the rapidities of the positively and negatively charged leptons. 


This complication gives rise to a subtle effect impacting on the measurement. 
Analysing the interplay of momentum transfer and parton centre-of-mass energy in 
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Eq. (2.73), which defines the form of the differential cross-section, it becomes clear 
that the positively charged leptons prefer to travel anti-parallel to the incident up- 
type quark. At the TEVATRON, this means that positively charged leptons preferably 
move in the direction of the anti-proton, and the negatively charged leptons prefer- 
ably move in the direction of the proton, i.e., in both cases, the leptons tend to move 
against the direction of the W boson they come from, partially compensating their 
initial boost. 

In proton-proton collisions, the picture then looks a bit more confusing. Naively, 
one would expect to have more Wt than W- bosons, and typically they would stretch 
out to larger rapidities, leading to the asymmetry always being positive and actually 
increasing with increasing rapidity. However, the W bosons have a maximal rapidity, 
about Ymax % 4.5 at a 8 TeV LHC, while the leptons, being effectively massless, can 
reach rapidities well beyond this point. This implies that at some point the typical 
relative rapidity the leptons have with respect to the original W boson becomes the 
dominating factor, and, as discussed, this is where the negatively charged leptons 
will become more abundant than the positively charged ones. This means that the 
asymmetry will turn negative. The fact that most of the bosons are not at maximal 
rapidity (which is possible only if one of the x, typically x, equals 1) and the fact 
that the positively charged leptons are typically oriented against the direction of the 
W~ boson, translate into this point being significantly more central than the maximal 
rapidity available to the W bosons. This behaviour is exhibited in Fig. 2.17, where the 
lepton asymmetry at leading order is displayed. 


2.2.4 W + jet production at leading order 


In this section, the emission of an additional parton will be interpreted as the produc- 
tion of the vector boson in association with a jet. The structure of this process will be 
analysed, with special emphasis on the collinear and soft limits of the matrix element. 


2.2.4.1 Structure of the matrix elements 


Consider real corrections to a given process, for example the real corrections to W 
production at hadron colliders. Relevant diagrams are exemplified in Fig. 2.18. If the 
additional parton is energetic enough, it will leave a trace in the detector by depositing 
some hadronic energy. If this deposit is well-separated enough, it is typically interpreted 
as an extra jet. Of course, the cross-section for such a process depends strongly on the 
separation criterion; here just a minimal transverse momentum of the extra parton 
will be demanded. By virtue of momentum conservation this will then immediately 
lead to the W boson recoiling against this jet, i.e., acquiring transverse momentum as 
well. 

Looking at them, it is important to stress that the Feynman diagrams are nothing 
but pictorial representations of quantum mechanical transition amplitudes, interfering 
with each other. At the same time, their quantum nature renders any question like 
“which of the two incident quarks emitted the outgoing gluon?” completely unphysical 
and meaningless. Such questions represent a futile and uncomprehending attempt to 
transport classical concepts to the quantum world. They are fully equivalent to the 
question of which slit the electron passed through in the double-slit experiment. 
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Fig. 2.17 The lepton asymmetry, defined in Eq. (2.92), at the TEVATRON, 
at the LHC at c.m.-energies of 8 and 14 TeV, and at a future hadron collider 
with a c.m.-energy of 100 TeV. The calculation has been performed at 
leading order with the CT10 PDF [713]. 


With this in mind, the real contributions can be decomposed into three sets of 
diagrams each with two interfering amplitudes: one set where a gluon is emitted into 
the final state, i.e. the sub-process ud > gWt[-> vf], and two sub-processes where 
the initial gluon gives rise to either a down-type quark or an up-type anti-quark, 
ug > dWt[> vel] and dg + uW+[- vel], respectively. These three sets of sub- 
processes do not interfere, since their initial and their final states are composed of 
particles that, in principle, could possibly be distinguished. 

Ignoring, for simplicity, the decay of the W-boson, the resulting amplitudes are 
given by 


tgsgwVud — a Pa- Pe 
Mad = —=— Udi 
ae cia V2 as cr — Pg (pq — Py)? "7 (2 93) 
Pu p a v,a l 
tup pF” Uu, jeW Eg 
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Fig. 2.18 Real contributions to the NLO correction of the production and 


leptonic decay of a Wt boson. Here u and d stand for arbitrary up- and 
down-type quarks, respectively. 
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Here, all colour indices and structures have been made explicit, by adding the funda- 
mental (or triplet) colour indices i and j of the quarks and the adjoint (or octet) colour 
index a of the gluon as well as the colour matrix Tj, appearing in the quark—quark—- 
gluon vertex. Looking at these expressions, it becomes apparent that the diagrams 
contain potentially divergent structures. The divergences emerge in those cases, where 
the additional parton in the final state becomes soft or collinear. The propagator of 
the intermediate parton line reads 
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1 1 
= 2.96 
(Pq — Pg)” 2E,E,4(1— cos 0) ’ ( ) 


which diverges for the energy of the outgoing parton, Eout, or its opening angle 0 
approaching 0. This type of divergent structure is also known as an infrared diver- 
gence. 


2.2.4.2 Matrix elements squared 


After squaring and averaging /summing over initial/final-state polarizations and colours 
and some colour algebra, the squared matrix elements are given by 


A C 2 Va 2 f2 ~2 2 2. 
Mega ee (2.97) 
ud—gW 12 ta 


and 


_ AnasTR Gey |Vual? 8? + G? + 2m2,t 


IM, aw+ = IM|5,aw+ = 12 (2.98) 


—ŝû 
In all cases Mandelstam variables have been used, namely 

8 = (pa + P)? = (p1 + 2)”, 

i = (pa — p1}? = (Pe — m2)”, (2.99) 
(Pa — Po)” = (pe — p1)” , 


where the incoming partons are labelled with a and b, and the outgoing particles are 
labelled with 1 and 2. These variables satisfy 


8+i+a=mtmpetmi+m, (2.100) 


the sum of the squares of the masses of the external particles. 

Closer inspection reveals that the squared matrix elements above can be written as 
the leading-order matrix element squared times a QCD emission term, which consists of 
the strong coupling and a colour factor times an expression representing the kinematics 
of the extra emission: 


(LO) |2 _ 72 n2 24 

2 _ |M pee t +â + 2mys 

Mewe = ae ey E 

(LO) |2 22 -2 27 

2 2 |M ud>W+ 8° + 0 + 2miyt 

IMlugsaw+ = IM|5,aw+ z We - (4rasTR) a’ | 
(2.101) 


2.2.4.3 Soft and collinear limits 


It becomes apparent that the result of the processes with gluon emission into the final 
state diverge, if either f — 0 or & > 0. This is the case, when the gluon is either parallel 
with one of the incoming particles, the collinear divergence, or if its energy vanishes, 
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the soft divergence, in wich case both ¢ and é go to zero. The divergent regions of the 
gluon emission phase space can be avoided by suitable cuts. A convenient choice here 
is to demand that the gluon has a minimal transverse momentum, which would be 
interpreted as the minimal transverse momentum of the corresponding jet. 

Similar reasoning also applies to the case where the gluon appears in the initial 
state, which exhibits a divergence with û — 0. There the case § — 0 is prohibited by 
the finite and sufficiently large mass of the W boson. This also prohibits the limit of the 
gluon energy going to 0, with the result that this process is less divergent than the one 
with the gluon in the final state. However, its remaining, purely collinear divergence, 
can also be avoided by a cut on the transverse momentum of the extra parton. 
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Fig. 2.19 The differential cross-section for WT +j production with respect 
to the transverse momentum of the Wt boson at the TEVATRON (left) 
and the LHC (right) at c.m.-energies of 1.96 TeV and 8 TeV, respectively. 
The error bands are obtained from a variation of the factorization and 
renormalization scales by a factor of two. 


Assuming massless incoming and outgoing partons, the outgoing momenta can be 
written as 


Pw = (miw cosh yw, pı cos ¢, pı sind, miw sinh yw) 


E , i (2.102) 
Pig = (P1 cosh Yq, g, —p1 cos Q, —p1 sing, pi sinh yq,g) , 


where p, is the transverse momentum of the jet and, by momentum conservation, of 
the W boson, and m yw is its transverse mass given by 


miw = ymy + Diy = ymy to. (2.103) 


The incoming momenta read 


Ph = ee (1, 0, 0, +1) (2.104) 
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and therefore, with x, = pi //s 
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The kinematic part of the gluon-emission matrix element squared thus becomes 
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where zm = mw /vys. This shows that for p? — 0 the matrix element diverges like 
2m?y /p7 , a logarithmic divergence. This becomes apparent in Fig. 2.19, where the p1 
distribution of the W+ bosons in the hadronic production of the bosons in association 
with a single jet at the TEVATRON and the LHC, calculated at leading order is displayed. 

To actually properly calculate differential cross-sections like the one depicted in 
Fig. 2.19, the phase-space element has to be determined in useful quantities such as 
the rapidity of the gauge boson and its transverse momentum. To this end, the phase- 
space element for the outgoing particles can be written as 


A e AE as — q) (2n)8(p2y — m3,) (2m)8(q2) 
(2m) r) oe . on 
mi wdm i wdywd? TEE ee 
= Mand iw O ty Q7 — iy) 8+ 6+ amy) 
= cow ee lE + i+ û my). (2.107) 


In the transformation of the ô functions, the fact has been used that 
5(q?) = 5((putpa—pw)”) = 6(8+miy —2(putpa)-pw) = 6(8+t+a—miy) (2.108) 


encodes the Mandelstam identity. 

Using this form of the phase space element, the fixed order cross-section for the 
production of a W* boson in conjunction with a gluon in the final state can therefore 
be written as 
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with the parton luminosity given by 
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Ignoring all information concerning the additional parton by basically integrating over 
its phase space, this can now be rewritten as the double differential cross-section for 
the production of a gauge boson, 
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Here, the relations 
bi LO 1 Tg |Vual? 
Qi = eo Cuaows (8) = s ao (2.112) 


cf. Eq. (2.90) for the latter, have been used to rewrite the matrix element squared in 
Eq. (2.97) as 


(LO) BTS AsCr i +0? +4 2Wm2.,8 
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The lower limits of the integration over the Bjorken parameters x 4 and zp in Eq. (2.111), 
Za, and Zp are fixed by the dynamics of the W boson: 


m% = Žąřgs and 4p = ee (2.114) 
Taking a closer look at Eq. (2.111) exhibits the divergent structures related to the 
emission of a gluon. First of all, there is the term 1/Q?, giving rise to a logarithmic 
divergence of the form dQ? /Q%. In addition, and less obvious, there is the implicit 
dependence on the Bjorken parameters x4 and xz of all kinematic quantities, which 
will lead to further divergences, which are partially related to the evolution of the 
PDFs. They will be treated in the next section, by identifying and suitably absorbing 
them. 
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2.2.5 Partonic cross-section at next-to leading order 


In order to achieve higher precision for the theoretical calculation, the evaluation 
of next-to-leading order (NLO) or next-to-next-to-leading order (NNLO) 
contributions or even beyond, is mandatory. Here, the focus will be on the discussion 
of the first perturbative correction, called the NLO correction. Looking at the Feynman 
diagrams of the example process, Fig. 2.15, higher-order corrections will involve higher 
orders in either the electroweak or the strong coupling constant. At hadron colliders, 
however, it it usually the latter, the QCD correction, which turns out to be significantly 
larger than the electroweak correction. 

Typically, such an NLO QCD correction is related to either the emission of an 
additional leg, i.e., an extra parton, into the final state (real correction) or to the 
emission and reabsorption of a parton, through a loop (virtual correction). The 
former actually has already been discussed to some extent in Section 2.2.4. 


2.2.5.1 Divergent structures in the matrix elements 


Similar to the divergences already encountered in the case of the real emission diagrams 
before, divergences also appear in the virtual correction, which is due to Feynman 
diagrams involving closed loops. Here, however, the divergences correspond to two 
cases: 


e Infrared divergences may show up when the momentum squared of one of the 
propagators approaches zero. In other words, when a particle in the loop goes on 
its mass shell. 

e Ultraviolet divergences may emerge in the limit k — oo, where the momen- 
tum running in the loop becomes infinite. In this case, terms like d4k/k” naively 
diverge, if n < 4. 

In all cases, dimensional regularization is the method of choice when such 
divergent integrals need to be evaluated. In this method, the originally 4-dimensional 
phase space of a single particle, is replaced with a D-dimensional expression, 


(27)6(p* — m?)@(E) — ay (mye? —m?*)@(E), (2.115) 
and the divergences manifest themselves as poles in 2/(D — 4) = 1/e. 

Concentrating on the case of infrared divergences, the complication here is that, 
for a process with N outgoing particles at the Born level, these integrals are over 
the real correction phase space with (N +1) particles and over the loop momentum 
in the N-body virtual correction term. For simple processes such as the example of 
inclusive W-production, these integrals are fairly straightforward to calculate directly, 
leading to the desired cancellation of infrared divergences as described by the BN and 
KLN theorems [257, 678, 724]. This direct approach, however, becomes increasingly 
complicated and forbids itself, if the phase-space integration is too complicated for 
direct analytical evaluation. In such a case, if the results of the phase space integration 
can only be obtained numerically, other methods have to be invoked. In the next 
chapter, this will be highlighted with a toy model, leading to a master formula for 
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the calculation of cross-sections at next-to-leading order. In a first step, however, the 
structure of such a calculation in its analytic form will be presented. 

In contrast, the ultraviolet divergences typically do not cancel each other, and 
they reside in the loop contributions only. Naively, whenever such a loop contribution 
is encountered, an ultraviolet divergence may not be far away. In such a case, the 
programme of regularization and renormalization of the diagrams and the theory, 
respectively, needs to be invoked, which, however, is a standard topic across textbooks 
on quantum field theory. In particular, in the case of non-Abelian gauge theories, 
renormalization becomes a somewhat tedious exercise due to many identities stemming 
from the gauge symmetry relating the different contributions to all orders. 


2.2.5.2 Matrix elements for virtual corrections 


The real emission contributions have already been discussed in the previous section, 
Section 2.2.4, where this part of the next-to-leading order calculation has been inter- 
preted as a new leading order process. Here, the results from Section 2.2.4 will be used 
in a slightly different way, namely as part of the higher-order correction to the inclu- 
sive vector boson production. This implies that, at first perturbative order in the 
strong coupling, the emission of an extra jet is a part of the inclusive cross-section. 
But while this calculation allows for a next-to-leading order description of the inclusive 
production properties with correspondingly improved perturbative uncertainties, the 
exclusive vector boson plus jet production part of the calculation still is at the 
leading-order accuracy. 

In the case of W* production the Feynman diagram related to the loop correction 
is depicted in Fig. 2.20. 


u Vg 


wr 


d g+ 
Fig. 2.20 Vertex correction contributing to the NLO correction of the 
production and leptonic decay of a W* boson. Here u and d stand for 
arbitrary up- and down-type quarks. 


For the virtual contribution, the amplitude reads 
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where D = 4 — 2e. 
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Closer inspection reveals that the ultraviolet divergences cancel exactly with the 
renormalization of the external particles, essentially obtained from self-energy dia- 
grams. This ultraviolet cancellation presents testimony to the fact that the ud-current 
is conserved. Therefore, the only remaining divergences are infrared, and the result for 
the virtual matrix element multiplying the Born-level matrix element reads !° 


ud>Wwt ud>Wwt 
2\€& (2.117) 
(0) 2 as H 2 3 2 
qo ws on Cr (=) cr (-2 ere nae ` 


with Q? = (pu+pa)? = miy and cr = 47€ /T(1—e). For a detailed discussion of how this 
result may be obtained, the reader is referred to the following chapter, Section 3.3.1. 


2.2.5.3 Next-to-leading order: Real corrections 


For the real correction diagrams, the result emerges from a D-dimensional integration 
over the phase space of the emitted parton, as discussed in the previous section. In 
the case of the additional gluon in the final state, with its momentum denoted by k, 
this yields (again, cf. Section 3.3.2 for details of this calculation) 
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The divergences in the first line of the result for this real correction exactly cancel 
the divergences in the virtual result. The additional divergence in front of the splitting 
function PD is in fact universal, i.e., process-independent. This process-independent 
term can thus be absorbed into the renormalization of the PDF, see Section 3.3. This, 
however, is a scheme-dependent procedure. In total then, a finite result is obtained. 

The method of directly calculating the phase space of the real emission part in 
D dimensions followed here of course becomes prohibitively complicated for processes 
with an increasing number of external particles. For such situations better algorithms 
have been developed, among them what is by now known as infrared subtraction 
algorithms such as Catani-Seymour dipole subtraction [344, 353] or the method 
by Frixione, Kunszt, and Signer [539, 542]. The former will be further discussed in 
Section 3.3.2. 


15More correctly, this is the result obtained in conventional dimensional regularization, a regular- 
ization scheme that is defined in Section 3.3.1. 
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2.2.5.4 Next-to-leading order: Final results 


Using the techniques briefly outlined in the previous section, and further worked out 
in detail in the following chapter, the total partonic cross-section for the ud > Wt 
production cross-section at NLO reads 


(NLO) _ «(LO) _ Qs(UR) An? 
Gg api Z Cg fı E Cr ( 3 8} d(1—2z) (2.119) 
4 (1-2)? a-z)? PRC), we 
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Note that the result still needs to be convoluted with the PDFs. The last term in 
the square bracket above, proportional to the splitting function and containing a term 
of log(u4,/Q*), ensures that there is a compensating factor if the PDFs are evaluated 
away from their “natural” scale (if u? 4 Q?); for the process at hand it is the only 
scale in the process, Q?. This corresponds to the “running” of the PDF, mediated by 
the corresponding splitting function and resluting in the logarithm of the scale ratio. If 
the process at leading order was already containing n factors of as, a similar running 
of the strong coupling, mediated by n first order terms of the 6-function would appear 
as well, this time of course with logarithms of the ratio of u3 and Q?. 

Another contribution, stemming from the gluon initial states, 


a (NLO) — (LO) 
ug=>dWt ~ Oud->wt 
este) y, | pa) (1-2) Mp \ 1 | 
zn TR Pg (z) | log 5 ee koU z)(1+7z)} , (2.120) 


needs to be added, which of course starts at O (as). 


2.2.6 Scales 
2.2.6.1 Sketching the issue 


In order to estimate theoretical uncertainties, it has become customary to vary the 
renormalization and factorization scales by a factor up or down. Typically this 
factor is chosen to be two, and in most cases both scales are changed in parallel, i.e., 
both are at the same time multiplied with either 2 or 1/2. There is some dispute on 
whether 2 is a sufficiently large factor to catch all uncertainties and on whether or not 
the scales are indeed chosen in parallel. There is, however, an additional caveat, which 
is more related to the actual choice of scale. Usually, the default scale u = ur = 
uR is determined by identifying it as the “characteristic scale” of the process under 
consideration, either given by some intermediate particle’s mass or some function of 
the final-state momenta. 

In such an arrangement, and at leading order, the factorization scale is typically 
interpreted as the scale up to which softer partons and, correspondingly, the emission 
of additional partons are ignored in the actual matrix element. Such emissions are 
rather subjected to a more inclusive treatment in the evolution of the parton content 
of the hadrons from hadronic scales to the actual harder, i.e. perturbative, scale of the 
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process. When considering more complicated processes, this simple picture is likely to 
change drastically. As an example, consider the case of W production in conjunction 
with additional partons, defined as jets at leading order. Very broadly speaking, one 
may distinguish two extreme cases of how the kinematics of this process works out. 
There will be a region, where the W boson is accompanied by softer partons; here, 
the parton emission can be thought of as QCD correction to an electroweak process. 
This is a case, where one may still think of using my or similar as the relevant scale 
for wr, and, maybe, even for ur. On the other hand, there will also be a region in 
phase space, where the partons are produced at transverse momenta, which are much 
larger than the W mass. In such a configuration, one may interpret the W emission 
as a weak correction to jet production, intrinsically a QCD process. Here, my, pY, 
or similar as the scale characterising the process would not be a great choice. In fact, 
recent work [225] suggests that choices that interpolate between these two regimes, 
such as Hr/2 or so are much better suited. Hr is defined as the scalar sum of the 
transverse momenta of all jets, leptons, and the missing transverse energy, 


Hr= X pigt+ do piat i. (2.121) 


jEjets lee 


What is clear, though, is that the scale dependence essentially stems from an 
inability to calculate cross-sections correctly, i.e., to all orders. Instead, in all calcula- 
tions, the perturbative series is truncated at some fixed order, which leaves a residual 
logarithmic dependence on the renormalization and factorization scales. With the per- 
turbative series thought to be asymptotic, there is some assertion that this dependence 
would diminish with each additional order in the calculation. While this seems to be 
by far and large correct, cases like W +3 jet production discussed in [225] indicate that 
with apparently bad choices of scale, pW in this case, this decreasing dependence is not 
always realized at next-to-leading order. However, apart from such pathological cases, 
higher-order calculations indeed always diminish the scale dependence. It is fair to 
state that, in order to have any reasonable and reliable estimate of the corresponding 
scale uncertainty, the inclusion of at least one additional perturbative order, i.e., a 
next-to-leading order calculation, is mandatory. 


2.2.6.2 A quantitative example 


In order to see more quantitatively how higher-order calculations reduce the artificial 
scale uncertainty due to the truncation of the perturbative series, consider an example 
worked out in [585], namely the single-jet-inclusive p; -distribution at the TEVATRON. 
At leading order, this distribution can be related to diagrams such as the ones shown 
in Fig. 2.21. At large transverse momentum — and therefore at large centre-of-mass 
energies and consequently at large parton « — the dominant contributions stem from 
quark—anti-quark initial states that can be seen as initial valence quarks. The lowest 
order differential hadronic cross-section is given by 


da (9) 


es = fasp(ur)faplur) ® a8 (HR) 6) (2.122) 
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Fig. 2.21 Some leading-order diagrams for inclusive jet production from 

a quark—anti-quark initial state. 
where 6°) represents the lowest order partonic cross-section. Including the next-to- 
leading order corrections, this can be written as 


da (NEO) 
dp, T fa/p (Mr) fajal Ur) 
(2.123) 
a? (ur)ê® + a3 (UR) (a + 2bo log PR 5 (0) _ 2P qq log araw) ; 
PL PL 


where logarithms that explicitly involve the renormalization or factorization scales 
have been exposed. The remainder of the O (a?) corrections are all incorporated in 
the function 6“), 

From this expression, the sensitivity of the distribution to the renormalization scale 
is easily calculated using the expression for the running of the strong coupling, 


Jas (ur) 


e S 2 _ 3 4 
B(log un) ~ 000s HR) — bras (un) + O (04), (2.124) 


cf. Eq. (2.18), where the two leading coefficients in the 8-function, bo and b1, are given 
in Eq. (2.20). Looking at the terms in Eq. (2.123) it becomes apparent that the first 
term in the square bracket, the leading-order contribution, and the term proportional 
to 2bo in the round bracket cancel out, such that the remaining dependence on upr is 
contained in terms O (aé). 

In a similar way, the factorization scale dependence can be calculated using the 
non-singlet DGLAP equation 


finalur) _ as(ur) 
O(logur) — 2r 


This time, the partial derivative of each parton distribution function, multiplied by 
the first term in Eq. (2.123), cancels with the final term. Thus, once again, the only 
remaining terms are O (a2). 

This is a generic feature of a next-to-leading order calculation. Any observable pre- 
dicted to O (a7) is independent of the choice of either renormalization or factorization 
scale, up to the next higher order in the strong coupling, O (agtt). Of course, this 
is only a formal statement and the numerical importance of such higher-order terms 


Pj 8 fiyn(ur), (2.125) 
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Fig. 2.22 The single-jet-inclusive distribution at p} = 100 GeV, appro- 
priate for Run I of the TEVATRON. Theoretical predictions are shown at 
LO (dashed blue) and at NLO (solid red). 


may be large. As a concrete example, consider jet production at the TEVATRON Run I. 
In that case the numerical values of the partonic cross-sections entering Eq. (2.123) 
are 6) = 24.4 and 6“) = 101.5. Equipped with these values the LO and NLO scale 
dependence can be calculated, as shown in Fig. 2.22, adapted from Ref. [585]. In this 
case the factorization scale has been kept fixed at ur = p, and only the dependence 
on the renormalization scale is shown. The figure indicates the expected result, which 
is that the renormalization scale dependence is reduced over a wide range of values for 
ur when going from LO to NLO. 

While the reasoning above, culminating in Fig. 2.22, is a fairly accurate represen- 
tative of the situation found at NLO, the exact details depend upon the kinematics 
of the process under study and on choices such as the running of ag and the PDFs 
used. It is worth noting, though, that due to the actual structure of NLO corrections, 
as exemplified in Eq. (2.123), there will normally be a peak in the NLO curve, around 
which the scale dependence is minimized. The scale at which this peak occurs is often 
favoured as a choice specific for the process, its kinematics, and additional cuts such 
as the jet definition. For example, for inclusive jet production at the TEVATRON, using 
a cone size of R = 0.7, a central scale of 


up = un = p%/2 (2.126) 


is usually chosen. This is near the peak of the NLO cross-section for a large set of dif- 
ferent observables, cf. the specific case shown in Fig. 2.22. Adjusting the scale choice to 
be in the region of the peak is often referred to as the “principle of minimal sensitiv- 
ity” [864]. It is worth keeping in mind that such a choice is also usually near the scale 
at which the LO and NLO curves cross, i.e. for the value of the scale where the NLO 
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Fig. 2.23 The scale dependence of the cross-section for Wbb production 
at the 14 TeV LHC (left). A real radiation diagram leading to the large 
corrections is also shown (right). 


corrections do not significantly change the LO cross-section. Setting the scale by this 
means assumes “fastest apparent convergence” (FAC) of the perturbative series [600]. 

Finally, a rather different motivation comes from the consideration of a “physical” 
scale for the process. As examples take p for the inclusive jet production process or 
the W mass in the case of inclusive W production. These typical methods for choosing 
the scale do in general not agree, leading to somewhat different results in dependence 
on the actual choice and thus a corresponding theoretical uncertainty. If, on the other 
hand, the scales or the respective results do agree — quite often this seems fairly 
accidental at first sight — this may be viewed as a sign for the perturbative expansion 
to be very well-behaved. 

A word of caution is due at this point. Although the improved scale dependence 
sketched out here is typical, it is worth reiterating that this is by no means guaranteed. 
In addition to effects due to apparently bad scale choices which do not appreciate the 
full intricacy of the kinematics, like the case hinted at above, there is also another 
source of potential pitfalls. They are related to cross sections, in particular at the LHC, 
which at leading order are driven by quark—anti-quark initial states. Since the LHC 
produces an abundance of gluons, real radiation diagrams containing one or maybe 
even two gluons in the initial state can give rise to very large NLO corrections. Since 
they enter for the first time at NLO, strictly speaking they are some kind of additional 
leading-order contribution appearing at higher orders. As such they typically also 
give rise to a sizable additional scale dependence. A well-known example is shown in 
Fig. 2.23 for the case of Wbb production at the 14 TeV LHC. In the absence of gluons 
the NLO calculation has the canonical behaviour; in their presence the rate is not 
well-controlled due to diagrams such as the one shown in the same figure. 


2.2.7 Other considerations 
2.2.7.1 Perturbative orders 


There is a somewhat tricky point in the discussion of perturbative orders. As a simple 
example, consider the case of the p] -distribution of a W boson produced in hadronic 
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collisions. At LO in collinear factorization, there is no distribution and pW? = 0, since 
there is no final state parton to compensate the recoil introduced by a finite pı. So, 
strictly speaking, the p,—distribution of the W boson at hadron colliders is, at leading 
order, an observable at O(a). This, however, is already the next-to-leading order 
level for the total cross-section. Not surprisingly, then, the theoretical uncertainty, 
i.e., the scale uncertainty, of the total cross-section is smaller than the induced shape 
uncertainty due to scale variations of the p,—distribution of the W. To make things 
even more confusing, note that, in contrast, the rapidity distribution of the W is 
O (al = 1) at leading order, since it is due to the PDFs which are of course present 
at leading order. At the same time, the production cross-section for a Higgs boson in 
gluon, gg > H, is O (a2) already at leading order, because the coupling of the Higgs 
boson to the gluons proceeds through a quark loop. Even integrating out the heavy 
quark does not change this picture - the emerging effective vertex is also proportional 
to as. In summary, this means that the correct assignment of the perturbative order 
as LO, NLO, and so on, is not a fact of merely counting orders of as. Instead it is a 
process- and, even more confusing, observable-dependent characterization. 


2.2.7.2 Total cross-sections and K-factors 


Another important point here is related to what is called a K-factor, which usually 
denotes the ratio of a higher-order result for a total cross-section and the leading order 
one: 


N)NLO 
gono _ Fro (X) (2.127) 
X otee (X) 


with X denoting a specific final state. Such a K factor is proportional to powers 
of as/(27) times factors which are usually of the order of 1. A prime example is 
the hadronic cross-section in electron-positron annihilations, related at leading order 
to the process ete~ — qq. Higher-order corrections now introduce either additional 
loops, without changing the final state, or the emission of further partons, for instance 
ete — qqg. The corresponding K factor is given by 

KNNLO oNNLO(e+e- — hadrons) 


YNI = 
ene mehaarons ok (ete- — hadrons) 
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At hadron colliders the situation is not as straightforward. There, the inclusion of 
higher-order corrections often allows other partonic channels to open up, which may 
lead to significant higher-order corrections. As an example in the spirit of ete7 > 
hadrons, consider the “inverse” process at hadron colliders, namely the production of 
lepton pairs. At leading order, the parton-level process is gg > ¢*+¢—. Higher orders 
now introduce, as before, additional partons in the final state or loops. There is one 
difference, however, namely the opening of channels with a gluon in the initial state, 


for instance qg > ql". Such processes may yield a significant contribution, because, 
dependent on the phase space taken by the lepton pair invariant mass, the gluon 


(2.128) 
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PDF may become significantly larger than the corresponding quark PDFs. In the case 
discussed here, this leads to NLO K-factors of the order of K}“° ~ 1.3 compared to 
the significantly smaller KNO kadron, ~ 1:03. 

This example is just the very innocent tip of an iceberg of a number of other 
processes, where the opening of additional channels leads to K factors which are 
drastically different from one. 

Nevertheless, there is yet another class of processes which obtain large NLO cor- 
rections, even without opening additional channels. The prime example for this is the 
gluon-induced Higgs production, gg + H, where the NLO K-factor is about a factor 
of two. The origin of the large K-factor is explained by two effects, that can be seen 
by consideration of the virtual amplitude for this process. The one-loop amplitude for 
gg — H, working in the large top-mass effective theory is, 


sye 2 ll 
M pM ah X so (4) «| aS +z] l (2.129) 
where the dimensional reduction scheme has been employed. Due to the analytic con- 
tinuation that must be performed in order to evaluate the virtual amplitude for time- 
like momentum transfer Q?, this formula contains a factor of —1?/2 for every factor of 
1/e? present. This expression is to be contrasted with the relevant part of the virtual 
amplitude for W production in the same scheme given in Eq. (3.64), 


(1) 0) RNS 2 3 
M = M x cr ( ) | a T+]. (2.130) 
TT E E 


In the formula above the non-pole term is (t? — 7), with the 7? factor resulting 
from the analytic continuation mostly cancelled by the numerical constant. However 
in Eq. (2.129) there is no such cancellation and the factor of 7? remains an important 
contribution. Moreover, as can be seen from these two formulae, this effect is amplified 
by the overall colour factor of C'4 for the Higgs case compared to Cp for Drell-Yan. 
Taken together, this explains why the K-factor for Higgs production is much larger 
than for the Drell-Yan process. Since the m? terms are so important numerically, 
renormalization group techniques have recently been used to resum them to all orders 
and thus provide improved predictions for the Higgs boson cross-section [130]. 


2.2.7.3 Differential cross-sections at fixed order and giant K factors 


Another caveat arises when considering differential distributions. Often, either due to 
the limitations of the calculation or because of specific cuts that are applied, some 
distributions have a kinematic limit at LO. Adding extra radiation at higher orders, 
and in the context of the discussion here, at NLO will frequently extend hitherto 
constrained kinematic ranges. 

An example of such a situation is shown in Fig. 2.24, which depicts the transverse 
momentum distribution of a Wt boson at the 7 TeV Luc. At leading order, it is 
computed from process pp — W* + 1 jet with a cut on the jet pr at 25 GeV which 
immediately translates into the same cut on the W* boson’s pr. At NLO there is 
another parton in the final state of the real emission contribution, with an arbitrary 
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transverse momentum. This additional parton, together with the original one, can 
produce a summed transverse momentum below the cut, and therefore the W* boson 
recoiling against the partonic system will populate the transverse momentum region 
below the 25GeV cut. This has two consequences. Since the region below 25 GeV 
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Fig. 2.24 The transverse momentum of a Wt boson produced at the 
7 TeV LHC, computed at LO and NLO in QCD, recoiling against a hard 
jet with pr > 25GeV. The discontinuity in the NLO prediction around 
25 GeV is caused by the opening of the kinematic region pr < 25 GeV at 
that order. 


originates solely from real radiation events, it should only be trusted as much as a 
LO calculation. In addition, the region around the kinematic boundary at 25 GeV is 
not well-described. The values of the histogram bins there are simply artifacts of the 
calculation and are not reliable. Smoothing out this sort of problem, and providing 
hadron-level predictions that can be directly compared with experimental results, is 
the domain of the parton shower and resummation. 

Another example for this has been discussed in [830], where the term “giant K- 
factors” has also been coined. Following previous work, in particular [200, 205, 321, 521] 
for the case of vector bosons accompanied by jets, but also, for the case of vector boson 
pairs, [252, 311], it has been observed that there are substantial NLO corrections for 
observables at large scales of the order of the vector boson masses and beyond, which 


W boson production at fixed order 79 


actually fall far outside the respective bands obtained from a simple scale variation 
by a factor of two applied on the leading-order result. As an example, Fig. 2.25 shows 
the NLO correction for the pı -distribution of the additional jet produced in V + j at 
the LHC. In the publications dealing with this problem, this apparently tremendous 
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Fig. 2.25 The transverse momenta of a Wt boson (left) and accompa- 
nying leading jet, jı (right) produced at the 7 TeV LHC, computed at LO 
and NLO in QCD. The leading jet is required to have p1 > 200 GeV, after 
jets are clustered with the k,-algorithm using R = 0.7. The central scale 
choice is uF = UR = lm, + pr. and the bands correspond to variation 
of this scale by a factor of two in each direction. 


K factor!® has been related to new configurations, like the ones exhibited in Fig. 2.26. 
Essentially, the idea there is that for jets at very large transverse momenta compen- 


wr 


Fig. 2.26 A typical configuration giving rise to “giant K-factors”. This 
configuration cannot be interpreted as real correction to any underlying 
W + 1 parton contribution, but rather as some electroweak correction to 
a QCD process, in this case a parton-level configuration related to dijet 
production. 


16Tn some loose sense, the term K factor introduced for total cross-sections could also be applied 
to the ratio of higher-order and lower-order results for distributions, with the effect that the K factor 
then becomes “local”. 
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sating each other, an additional relatively softer vector boson could be interpreted as 
a real electroweak correction to a process that essentially is a QCD process. One of 
the strategies for addressing this sort of issue, presented in Ref. [830], will be discussed 
further in Section 3.4.2. 


2.2.7.4 K-factors: rules of thumb 


The K-factor for a process is a useful concept, but often ill-defined, as it depends on 
the order of the PDFs (and the as(mz) value) used in the evaluation of the matrix 
elements in the numerator and denominator of the K-factor, the scale choices in those 
matrix elements, and even the chosen jet algorithm and size, for processes involving jets 
in the final state. See for example, the discussion in Ref. [320] and in Section 4.3. NLO 
corrections are most-often thought of as positive additions to the leading-order cross- 
section, but, depending on the parameters listed above, can just as well be negative. It 
is important to remember that the K-factor is a ratio of cross-sections at two orders, 
and it is most often the leading-order cross-section that is changing most rapidly, and 
the NLO cross-section that has a more stable scale dependence. 

There are some rules of thumb, however. NLO corrections tend to be large for 
processes for which there is a great deal of colour annihilation in the interaction. The 
prime example is the process previously discussed, gg —> H, in which two colour octet 
gluons collide to produce a colour-singlet Higgs boson, with a very large correction 
from LO to NLO. In addition, NLO corrections tend to decrease as more final state 
legs are added. For example, the K-factor for gg + H +jet is less than that for gg > H. 

A useful, albeit simplistic, rule is given by the equation below: 1” 


Ci ate Cig z Cf maz (2.131) 


The relative size of the NLO corrections for a given process depends on the sum of 
the Casimir colour factors for the initial state minus the Casimir colour factor for the 
biggest colour representation possible for the final state. Again, this is a rule of thumb 
and not a rigorous statement. It is also not yet clear whether this argument can be 
extended to calculations at NNLO. 


2.3 Beyond fixed order: W boson production to all orders 


In this section, the basic ideas underlying resummation techniques are discussed. After 
introducing some of the technology and terminology through a classic example in QED, 
the findings will be generalized to the case of hadronic W production. 


2.3.1 A QED example 


In the following the discussion will follow the classic example of resummation, first 
described in [796], namely the low p, distribution of lepton pairs produced in e+ e7 
collisions, where the contribution of the Z° bosons has been ignored. 


17The so-called Dixon conjecture. 
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Fig. 2.27 Diagrams for the process e et — t47 (upper panel) and for 
eet — + é~+ (lower panel) in QED. The diagrams relating to the Born 
term in the upper panel and to the photon emission off an initial line, like 
the one in the left lower panel, are relevant for the following discussion. 
Diagrams corresponding to photon emission off a final-state leg will be 
suppressed by fixing the ¢¢~ invariant mass. 


2.3.1.1 p,-spectrum in the single emission approximation 


Typical diagrams related to the emission of additional photons in the process e~ e+ — 
y* — €~¢* with y* signifiying a virtual photon are exhibited in Fig. 2.27. The emission 
of photons off the final-state leptons, like in the right diagram of Fig. 2.27, will be 
ignored for simplicity. This is well justified, as in the case of QCD discussed below, they 
will also be irrelevant, since gluons do not couple to leptons. In this approximation 
the cross-section for this process can be written as a combination of e~et > y*7¥ 
with the subsequent decay of the virtual photon to the leptons, y* — £~¢*, which is 
supplemented in a seond step. 

Then the former part can be obtained from a similar process already encountered, 
namely the emission of an additional gluon in the production of a heavy vector boson 
with mass M in the annihilation of a quark—anti-quark pair, cf. Eq. (2.97). There, of 
course, the massive boson was a W boson. Adjusting couplings and ignoring colour 
factors, therefore 

Mi sora a ee 
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where Q? = M? denotes the invariant mass of the virtual photon and ŝ, Ê, and û are 
the usual Mandelstam variables, satisfying 


s+t+a = M? = Q? (2.133) 
or 
Q?-i-a=. (2.134) 
Using 
dô |M]? 
— = 2.1 
dt 1678? (2139) 
one therefore finds 
dG e- etry = zi E+ at + 2Q78 l (2.136) 
dt 5 tt 


for the differential cross-section. In the following the limit of Ê, â — 0 or, equivalently, 
§ = Q? will be considered. Factoring out the cross-section for the production of a 
virtual photon, 


(LO Ana Ana 
a= gn =e (2.137) 
yields 
a. coe 2 2 42028 
SE O e wey (2.138) 
dt 278 ta 


This ultimately also allows the replacement of this cross-section for the production of 
a virtual photon with the cross-section for producing a lepton pair instead, 


2 
~ (LO) _ 4ra 
Teret atit T 3Q2 ? (2.139) 
to arrive at the double-differential cross-section 
dôe-et—t- e+ ~(LO a +4? +2Q73 
Se CESP eg So a l (2.140) 
dtdQ 2T8Q tû 


The limit of small transverse momenta of the virtual photon, or, equivalently, the 
lepton pair, Qı — 0, is related to t > 0, â — 0, or both ¢ and û approaching 
zero. Due to the symmetry of the cross-section in Eq. (2.140) under the exchange 
Ê © û it is sufficient to only consider the case where one of the two Mandelstam 
variables goes to zero, say Ê — 0, with the other one being less singular, but potentially 
approaching zero as well. Assuming that f is the relevant Mandelstam variable allows 
the replacements 


f+ -@2 30 and û = Q?-3-i > Q?-8. (2.141) 


To capture the potential divergence for û — 0, one must integrate over û. This is 
equivalent to integrating over Q?, which in turn exposes the logarithmic divergence 
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for & + 0 as the upper limit at the kinematic boundary of Q? ~ 8, namely 
QR <3- 07» (2.142) 


Keeping in mind that the region Q? < & is being analysed therefore results in 
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Consider now the Q1 -integrated scaled cross-section 
pr 
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with pı some arbitrary but finite cut-off on the maximally allowed transverse momen- 
tum of the lepton pair. This integral will diverge due to the collinear divergence that 
is manifest in the 1/Q* term in the real emission term GR. So, naively, one would 
now expect that this cross-section diverges. In fact, however, this is not the case; as 
discussed in Section 2.2.5 the Block—Nordsieck and Kinoshita—~Lee-Nauenberg theo- 
rems guarantee that this divergence will be cancelled exactly by another divergence 
in the corresponding virtual contribution, where again the final state fermions will be 
ignored. This results in a finite overall cross-section at order a [257, 678, 724]. In other 
words, adding in ôy x 6(Q? ) and normalizing to ĉo results in 


re -2j ag PEO 140l), (2.145) 


with the coefficient of the order-a contribution being free of any potentially large 
logarithms. Decomposing the integral as 


2 2 dôr + Gv) _ d(Gr + ôv) dôr 
ĉo l dQ. dQ? = al di a dQ? +f dQi dq? |’ (2.146) 


pi 


where the fact that the virtual contribution Gy is concentrated in the region of Q? = 0 
has been accounted for, allows one to rewrite 
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Fig. 2.28 Photon emission off a fermion line. Here, the thick blob repre- 
sents the rest of the process, while the fermion line corresponds to one of 
the incoming electrons. 
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In the approximations up to now all terms that would yield constant or single logarith- 
mic contributions have been ignored — therefore the result obtained in Eq. (2.147) is 
correct in the double leading logarithmic approximation (DLLA), also known 
as the Dokshitzer-Dyakonov—Troyan approximation (DDT), which was first 
developed in [476]. 


2.3.1.2 Multiple soft photon emissions 


In order to deal with multiple photon emissions it will prove useful to remember the 
way the eikonal expression in Eq. (2.17) has been derived. In a sketchy way, the 
relevant photon emission contribution exhibited in Fig. 2.28 can be written as 


= H p+# ae = p'e 
e €,(k) U(p)y pane = aup) k os (2.147) 


where terms have been ignored that are finite in the soft limit k — 0. This result 
emerges from using both the anti-commutator of the Dirac matrices and, subsequently, 
the equation of motion for the spinor. 

In the same way, the two photon contributions can be written as a factor 
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where the factor of 1/(2!) stems from the proper symmetrization of the two identical 
particles, the two photons, in the final state. From there, it is straightforward to see 
the form of the n-photon contribution, 
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e” pe, Pp €2 Pén 
ch ruts 2.149 
n! p ky pika pe kn ae 
The corresponding n-photon contribution to the scaled cross-section reads 
a el oi ee 5J” 
E (p) = — = Jawe log z = | = log? J ; (2.150) 
= n! | a J L Q? n! Qn p? 
taking into account the leading logarithms only. This implies that 
Zi a 3 |" a 8 
DANE 2 2 2 
(Qi) = 5 -l | = log A exp | = log J (2.151) 
n=0 


after summation over all possible number of emissions. 
From this the normalized differential cross-section with respect to the transverse 
momentum squared can be obtained to all orders in the DLLA as 


1 dô d a 1 § Q § 
= E(Q?) = l log? . 2.152 


Inspection reveals that in this expression the resummation of the double leading log- 
arithms has indeed tamed the divergence for Q3 — 0, and in fact the cross-section 
tends to zero in this limit. It can be shown that this very nice suppression in fact is 
too strong and sub-leading logarithms will ameliorate this situation. 

E(QŽ) is called the Sudakov form factor, and it encodes the probability for 
not emitting a photon with a transverse momentum larger than Q1. This probability 
tends to zero for Q1 — 0; in other words it is impossible not to emit photons with 
arbitrary small transverse momentum. 


2.3.1.3 Impact parameter space 


Up to now, the individual photon emissions have explicitly been treated as indepen- 
dent, uncorrelated processes, which is not true. After all, the transverse momentum 
Q. of the lepton pair is given by the sum of all individual emissions, 


i=- kia. (2.153) 
1=0 


This constraint can be cast in the form of a 6-function, which in turn can be expressed 
through a Fourier transform to impact parameter space. The impact parameter is 
conjugate to the transverse momentum, 


5° (a. +a) = z fe exp R . (a Sh) (2.154) 
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Similarly, the individual emission factors 


al § 
kai — 2.1 
can be Fourier transformed to yield 
1 3 5 
vidi.) = E | Prep [ib a Es] v(ky i). (2.156) 
T 


The contribution of exactly n photons that conspire to yield an overall transverse 
momentum of Q1 of the lepton pair thus reads 


1 dé™ 1 i eS n 
55 aE = ant | PH ow [EG] pe) (2.157) 
A ! 


and summing over all numbers n of photon emissions results in 


1 qô) 1 = > 
EE = [09-0] ole) 
o dQ? (2.158) 


= > feas Jo(b1 Q1) exp [ve] ; 


where Jo is the typical Bessel function stemming from angular integrals. 

A few comments are in order here. This method to include transverse momentum 
conservation seems at first a bit opaque, it is however necessary. It is crucial in a 
situation where the photons balance each other and lead to a zero net transverse 
momentum of the lepton pair. In such a situation the vanishing Q, guarantees that 
the Sudakov form factor goes to zero, indicating that at least one photon must have 
been emitted. Configurations, where only some of the photons are correlated such 
that their transverse momenta compensate each other, are of non-leading order in the 
leading logarithmic expansion for arbitrary, and possibly large, transverse momentum. 
These configurations become leading when all photons are correlated and mostly soft, 
because this is the only way to obtain the zero total transverse momentum of the 
lepton pair. In that respect the Fourier transformation to impact parameter space is 
a straightforward way to incorporate sub-leading effects due to transverse momentum 
conservation systematically. 

In the example here, small transverse momenta of the lepton pair translate to large 
impact parameters, and vice versa. When discussing the example with hadrons in the 
initial state, however, the naive Fourier transform encountered up to now will be fur- 
ther modified by form factors which take into account that the incident partons have a 
non-zero intrinsic transverse momentum, which is typically non-perturbative in origin. 
In such a picture, where multiple parton emissions are considered, the factorization 
scale will decrease from the hard scale of the process down to the low scale, where the 
perturbative treatment is assumed to break down. This scale is then identified with 
the inverse of the large corresponding conjugate impact parameter b_. 
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2.3.1.4 Adding electron structure functions 


There is yet another physics effect that needs to be taken into account. As we have 
seen in previous sections of this chapter, quantum fluctuations introduce some internal 
structure for the initial leptons, cf. the discussion in Section 2.1.1. This structure 
can be captured by the introduction of electron structure functions, here denoted by 
fejelt; u3), where the factorization scale typically is chosen to be of the order of the 
transverse momentum of the final-state lepton pair. This feature will become more 
important in the QCD case; here it suffices to say that an obvious choice for the 
structure function in b-space will be such that the factorization scale is replaced by 
1/b1. 


2.3.2 Standard resummation for pı w 
2.3.2.1 Summing soft gluons 


Applying the technology developed so far in a naive translation from the QED to the 
analogous QCD case, the pı -distribution of the lepton-neutrino pair in qq’ > veb”, 
essentially reduces to replacing the QED emission factor of Eq. (2.155) with a suitable 
QCD factor. This can be done by merely replacing the a@ with as and adding the 
corresponding quark Casimir operator CF. 

There is an important difference, however. While in QED the coupling does not 
change dramatically with the scale, the QCD coupling does. This implies that the 
choice of scale in this emission factor is important and will have numerically significant 
consequences. By merely glancing at the structure of v(k_,;), the only available scale 
within this factor is the transverse momentum, and thus 


(RG 1 
pQCD) (k) = a Cr loe (2.159) 


2.3.2.2 Leading-order leading logarithmic expression (LOLL) 


At the accuracy level achieved so far, leading logarithmic accuracy, the doubly 
differential cross-section for the production of a singlet, here a W* boson, reads 


da X d?b Shey ENRE 
AX = aAA S a [explib G Wat Q aa, a) }, (2.160) 
ij 


dydQ? 


with y and Q1 the rapidity and transverse momentum of the singlet. The factor 7 
on the right-hand side of the equation stems from the integration over the azimuthal 
angle of the produced system X. The leading-order cross-section for the production of 
the singlet X would of course be identifed with the W-production cross-section: 


2 2 
.(LO) _ (10) _ 4ra|Vi| 
ij>X = “ij>aWwWt — 


ee 2.161 
12sin? Owm?, ( ) 


Eq. (2.160) already exhibits the choice of scale at which the PDFs are evaluated, 
namely 1/b,, as seen in the equation below for W;,, 
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Wij (0; Q, ea, £B) = 


2 


Q 
1 1 dk? Q? 
fija (2a. =) fi/B (xo, =) exp |— I a (A044) toe S) 


(2.162) 


The sum runs over all relevant parton flavours 7 and j contributing to the production 

of X. The energy fractions with respect to the incoming hadrons A and B, z4 and 
zp, are fixed by the rapidity of the singlet system, 

Mx + 

x = =e, 2.163 

AB = 7 (2.163) 


which is also the solution for the zeroes of the residual 6-function in Eq. (2.107). 

Comparing terms with previous equations also shows that the exponential in the 
resummation part Wi; is nothing but the Sudakov form factor in kı space. Thus, 
the term A(k?), at the logarithmic order considered up to now, i.e. double leading 
logarithms or DDT approximation,!® and ignoring the potentially dangerous region of 
large b1, is given by 


as(hi) 


T 


Athi y= CFf (2.165) 


2.3.2.3 Improving the accuracy: the master formula 


In order to increase the accuracy of Eq. (2.160), higher-order terms must be added, 
extending the equation by various bits which have hitherto not been included. In 
particular, this includes emission terms beyond leading logarithms which can be ex- 
ponentiated as well and thus enter the Sudakov form factor. This will translate into 
supplementing the term Alog(Q?/k?) with another, non-logarithmic term B in the 
exponential of Eq. (2.162). In addition, higher-order terms H and C stemming from 
finite virtual and collinear parts, which come with the same underlying Born kinemat- 
ics, must be included; they will multiply the resummation bit Wij, which is therefore 
modified. Furthermore, exact real emission matrix elements can be added in such a way 
that the terms which describe the emissions in the soft leading-logarithmic approx- 
imation through the Sudakov form factor are not double-counted. These additional 
hard emission terms will not be resummed and therefore enter as some finite remain- 
der Y;;.x, effectively the difference of the exact real emission matrix element and 
the correspondimng O (a,) expansion of the emission pattern encoded in Wij. With 


18In fact, in the original DDT result, the Sudakov form factor was given in a form also including 
some universal sub-leading logarithms (the term 3/2): 


2 
ky 


| Q? 2 a (k2 2 ] 
a f u wp Cr (we -3)): (2.164) 


a 
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this in mind, the master equation for Q, resummation in the CSS formalism 
discussed up to now reads 


do % d?b EAO E 
ARAE Ds RO Sy {| nye [epot - Q1) Wi; (b; Q, £A, va) 
ij 


dydQ? 
(2.166) 


+Yij>x(Q1; Q, TA, va) ; 


where Wj; and Yi;,x will be given in Eqs. (2.167) and (2.170), respectively. 
For the resummation bit, increasing the precision amounts to extending the func- 
tion W;; such that 


1 


Wi;(b; Q, 2a, £B) = 5 jeje = tar (sa +) fo/B (e +) 
ab LA 


TA B TA TB 
X Cia —, b1; Cj z, bl; Ha , ; 
: 7 r) (2 n) (eB En n) 


2 
x exp | — J at (AGH) 08 S + B) 


(2.167) 


All terms A, B, C, and H can be expanded in a perturbative series, which will define 
the formal accuracy of the result in terms of the logarithmic order (LL, NLL, ...) 
and the fixed-order (LO, NLO, ...) accuracy. Note that the origin of the terms A and 
B can also be traced to the splitting functions, see also Sections 2.3.3 and 5.2. This 
expansion yields, for example, 


A(n) = 5° Eon AW) (2.168) 


and similar for the other terms. For instance, by direct comparison with the previous 
result A“) = 2Cp. In contrast to A“, however, the result for B“ depends on the 
lower limit of the k? integration. Choosing (2e~7” /b,)? = (bo/b1)?, leads to BO) = 
~3Cr/2, while in [410] it was given by B® = 2Cp log 2" 

The expansion of the C and H terms starts at N = 0, wath 


CL (2) = 6iad(1 — 2) 


(0) (2.169) 
Hip (ZA, ZB; H) = 9ia5;05(1 — z4)ô(1 — zB), 

which allows one to recover Eq. (2.160). 
It is worth stressing that, to the first few orders, the terms A and B depend on 
the incoming flavours only. For the case of W production these are quarks, which is 


90 Hard Scattering Formalism 


reflected in the corresponding colour factor, Cr. For other processes, such as, e.g., the 
production of a Higgs boson in gluon fusion, gg —> H, these terms would be propor- 
tional to C4, a factor of two larger, apart from sub-leading colour corrections. This 
reflects of course the fact that, trivially, a gluon has two colour degrees of freedom, 
while a quark has only one such degree of freedom. A more detailed discussion of 
the technical steps summarized here, and an application of this formalism to a vari- 
ety of processes, will be presented in Chapter 5. There, other different resummation 
techniques will also be briefly discussed. 
The finite remainders Y can be expanded in a similar series, 


ae Q, 4; £B) = 


ie Zer fe (£B: n) >, ($ Rox (e aa’ all 


with the first non-trivial terms RY say different from zero listed in relevant sections in 
Chapter 5. 

In principle, scales in the hard remainder could be chosen differently from the 
choices made in the resummation term; this has been made explicit by introducing 
separate factorization and renormalization scales wr and upr. In addition, the hard 
scale Q can be identified through Q = Mx = my and the Mandelstam variables 


which will emerge in the functions R;j—x can be expressed through the other kinematic 


parameters as 
1421 
1 ; vito 
ê= — QÈ and fû =| po Z] e. (2.171) 


EAEB : EBA 


The connection of resummed and fixed-order cross-sections is exemplified for the 
case of W* production at an 8 TeV LHC in Fig. 2.29. As the transverse momentum of 
the Wt boson approaches zero, the resummation bit, the exponential term in Wi; takes 
over and guarantees that the triple differential cross-section goes to zero, although the 
fixed (first) order result diverges. On the other hand, the finite remainder terms, the 
differences between resummed and fixed-order cross-section are visible only in the hard 
region, where Q1 —> Q = my. Note that the soft region, where b + oo, has been 
regularized with the Collins-Soper scheme, which will be discussed in more detail in 
Chapter 5. There, in addition, more results for this and other processes will be given 
and the anatomy of different contributions will be discussed in more detail. 


2.3.2.4 Dealing with the soft region 


This looks very good, but it has an additional problem, not present in QED. Integrating 
over all values of kų down to zero results in the need to evaluate the strong coupling 
at scales around and below the Landau pole, where it diverges. 
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Fig. 2.29 The triply differential production cross-section for a Wt boson 
at an 8 TeV LHC, from [300, 716]. As customary, the cross-section is evalu- 
ated for Q? = mi, and at central rapidity y = 0. The soft region has been 
regularized according to the Collins-Soper scheme. 


This is typically cured by modifying the Sudakov form factor in impact parameter 
space. A naive way of achieving this is by adding a soft form factor through a function 
p, multiplying the Sudakov form factor for all values of bı . A typical parameterization 
of this function p would look like 


p(b.) = exp (-4) , (2.172) 


where A usually is identified with the average intrinsic transverse momentum of the 
incident partons due to Fermi motion inside the nucleon or similar effects, 


A= eee ae x (2.173) 


The effect of such a modification is negligible for small b] , but effectively amounts to 
a dampening for large b} or small k]: 


E(@CD)(p, ) ae CCD) (5, ) pbi). (2.174) 


Of course, there are more methods to tame the divergent behaviour of QCD around 
the Landau pole, which will not be further discussed here. 


2.3.3 Aside: resumming jet rates in ete~ — hadrons 
2.3.3.1 Origin of the resummation terms A and B 


The formalism developed for the case of the p, spectrum of the W boson in the 
previous section can also be applied to a different case, namely the formation of jets in 
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the annihilation of electron—positron pairs into hadrons. This additional example will 
further highlight the versatility of the resummation approach, and it will also motivate 
why parton shower Monte Carlo event generators do a fairly decent job in describing 
the bulk of existing data. 

As a first step the Sudakov form factor, encoding the resummation of multiple 
emissions, needs to be re-examined. This will lead to an alternative explanation for 
the specfic form of the terms A“ and B®. To see how this works, consider gluon 
emission off a quark in the approximation, where the gluon is collinear with the quark. 
Then, the respective matrix element is well approximated by the splitting function, 
Paq(z)- This splitting function would merge when keeping also terms of order k” /(p- k) 
in Eq. (2.147). Correspondingly the emission factor v(QCP) in Eq. (2.159) becomes 


FOP) hihi PRG tee 25 aiae E E (2.175) 
T ky ka aag i T ky me , 
with ‘ 
l+z 
Pul) = 7: (2.176) 


The integration over k, must also be supplemented with one over z. This integration 
needs furthere manipulation, because of the divergent structures in splitting functions 
for z + 1 or z > 0, which must be regularized. While this is typically achieved through 
the “+”-prescription, cf. Eq. (2.33), another way to guarantee finite integrals will be 
pursued here. 

In the application of the resummation formalism to the description of jet produc- 
tion, the emitted partons, the gluons, must be resolved, for example by demanding 
that they have a minimal transverse momentum Qo with respect to the emitting quark, 
kı > Qo. Momentum conservation ensures that the momentum fraction they carry 
away from the quark must be non-zero, and the residual momentum fraction of the 
quark therefore must be smaller than 1 by an amount of € = k? /Q?, if Q is the 
scale of the quark momentum. This allows to drop the “+”-prescription in the split- 
ting function, and, correspondingly, the 6-function compensating for it. This form of 
modification, from now on, will be made obvious by replacing Pi; — Pij. 

Thus the Sudakov form factor can be rewritten for this specific case as 


l-e 
dk? | as(k? 
S(Q, Qo) = exp Jh (ea) ip dz Pig) (2.177) 
oa "9 


Inspection of the z-integral reveals that it can be approximated by 


=g le l-e 1 

1 2 2 
[Pal = Cr pu n x Cr fey - feara 

-z -z 
0 0 0 0 (2.178) 
2 
= 2Cp og Q - A = TAQ? k2). 
ko 4 
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Here, for the sake of the following discussion, the integrated splitting function [ 
has been introduced. From these results, the coefficients A“) = Cp and B® = —3Cpr 
are readily read off. Of course, they are identical to the coefficients in the standard 
resummation procedure outlined in Section 2.3.2. 


2.3.3.2 Sudakov form factor and its interpretation 


This allows one to write the Sudakov form factor in this case as 


2 


5(Q?, Q2) = A(Q?, Q2) = exp |- i dki 
Q? 


a T(Q?, k?) (2.179) 


How can this be interpreted? In Section 5.3 it will be argued that the Sudakov 
form factor is nothing but a probability for a given particle not to radiate a 
secondary particle between the two scales Q? and Q?. This can be motivated by 
realizing that 


(a) its kernel is related to the emission of a particle; 
(b) its form as an exponential of a negative definite argument guarantees that 


S(Q?, Q2) € [0,1] (2.180) 


as expected from a properly defined probability; 

(c) in the limits Q? >> Q? it approaches 0, which is what one would expect: it becomes 
increasingly unlikely that particles do not radiate in an increasing interval of 
scales. 


This reasoning will be further extended at the beginning of Section 5.3, by elaborating 
on a correspondence with the decay of a radioactive isotope. 


2.3.3.3 Sudakov form factor for fixed and running a, 


In general, the integrated splitting kernels are given by the usual coefficents 


Q? 
Pa’, P) = AY log 7z + B pa (2.181) 


( 
9 4:9 
Note that in the case of gluons the corresponding integrated splitting function consists 
of two parts, the g + gg and g —> qq splittings. This leads to Sudakov form factors 


given by 


2.89 Os a) 2 Q? (1) Q? 
Agg(Q", 1) = exp|—5— | Aga los gt By) log “y (2.182) 


for a fixed ag or 


s (Q* s(Q*) 
Ay (Q, P) = exp Ee (a log? a + BO) log Par )| (2.183) 
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for ag running at one-loop. 


2.3.3.4 Emissions and jet rates 


The two-jet rate Ro in e~e+ — hadrons is just given by the combined probability 
that neither the quark nor the anti-quark in the final state of the underlying Born- 
level diagram radiate a gluon above a jet resolution scale given by pP > Qeut- The 
probabilistic interpretation of the Sudakov form factor yields 


Ra(Qeut) = [Ag(Q?, Q2] - (2.184) 


In order to obtain the three-jet rate, it is important to realize that the probability 
density dPraa(q7 )/dg? to have a radiation off a quark at q1 is given by 


dPraa(q7,) = dA, (Q?, OF) 
dq? dq? 
_ alg) 1 2 2 2 n2 Di 230 2 72 
= = Cr g Ta(Q , qi) Aq(Q , cut) O(Q q4 )O(q1 sa: 
Ti 


(2.185) 


Here, the O-functions guarantee that q? € [Q?,,, Q?]. 


cut? 
This allows one to write the three-jet rate as 


Q? 2 2 

AG = 2A4(Q%, ae) _f | (erie È) 
TL T 

Qat 


A,(Q?, 2a) 
Ay(Q?, È) 


x Alg? , cut )Ag (q7, 3 , 


(2.186) 


where the term in the round brackets accounts for the emission of the gluon off one of 
the two quark lines, and the two additional Sudakov form factors ensure that the two 
offsprings of this splitting do not radiate further. The ratio of Sudakov form factors 
can be interpreted as the probability for the intermediate quark line not to experience 
any radiation resolvable above Qeut, between Q? and qf. 

Similar expressions can also be constructed for higher jet multiplicities. A conve- 
nient way to do this is by the introduction of generating functionals, cf. [345] for more 
details. 


2.3.3.5 Scaling patterns in jet production 


Generating functional technqiues can also be used to study the scaling behaviour of 
exclusive jet multiplicities, i.e., the cross-sections o, for the production of exactly n 
jets, possibly in association with other particles. There are two extreme scenarios or 
scaling patterns, namely 


1. staircase scaling, which is characterized by the ratios 
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on Rn ; On 
Rin+1)/n = a = R or Rin+1)/n = i = R with R, = eat 
(2.187) 
being constant, and where R depends on the core process and the requirements 
on the jets only, but not on the multplicity of jets. This pattern is also known as 
Berends scaling [223], and it has been observed for example by UA2 [140], at 
the TEVATRON [107], and the LHC [20, 364]. 


2. Poisson scaling, where the exclusive jet rates Py behave like 


n” =n ay 


n 
Rn = —— or Rin+1)/n = 


< (2.188) 


n+1` 
Such patterns emerge when individual events, in the case at hand jet formation 
through hard parton radiation, repeat themselves and are independent from each 
other. 
In order to see how this pattern emerges, consider the subsequent emission of two 
gluons off a primary quark line, given by 


2 


Q? 
Gad X ; faia t)A,(Q?, t) Pirog. PAHO 2) 
Q Q% 

(2.189) 
where Qo represents the jet resolution scale and Q is the hard scale of the process, 
and the factor of 1/2 accounts for the ordering of emissions. The Sudakov form factors 
guarantee that the gluons do not experience any further splitting, and they form jets. 
Similarly, one could have a second contribution, where the second gluon emerges from 
a secondary gluon splitting, 


2 t 


oL) agg X / dtr gq (Q, AO? t) f dt Tggglt, t)Ag(t, t) |- (2.190) 
Q3 Q3 
There are two important limits to consider, namely 


1. % log? on > 1. Expanding the results for 0) and o) around Qo/Q > 0, the 
leading contributions are given by 


2 
IP) hg X : = log? E V4a,C% log = +0 (3) 
Q 


2 
OP) hy X I K — 1) Va,C4 log a, +? (3)| ; (2.191) 
0 
indicating that the pattern of subsequent, primary emissions off the quarks is 
enhanced with respect to the secondary gluon splittings. This is the limit of in- 
dependent emissions, therefore exhibiting Poisson scaling. 
2. S log” & < 1. In this limit 
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as 4 6 Q 2 
aP agg X =<— log* — +O (e: log 2) x ol? (2.192) 
0 0 


q—>qgg 4(27)? Q q—>qgg ? 


and primary and secondary emissions off the quark or gluon, respectively, con- 
tribute in roughly equal, democratic measure, with relatively small emission prob- 
ability. This modifies the Poisson scaling. In QED such a non-Abelian contribution 
from secondary emissions is absent, and as a consequence Poisson scaling is not 
modified in QED. 


At this point it should be clear that in order to observe Poisson scaling, the individual 
emissions must be decoupled, to be treated as independent. This can be enforced by 
selecting events with a strong hierarchy, for example by demanding that the hardest 
jet has very large transverse momentum, while all other jets can be much softer. In 
such configurations, the large scale ratio of the hard jet to the softer ones guarantees 
large logarithms. Increasing the number of jets leads to more scale ratios with vanish- 
ing logarithms, and this also introduces a jet emission phase space that is increasingly 
constrained through momentum conservation. As an overall effect, the absence of log- 
arithmic enhancement means that the emissions cannot be treated as independent 
any more, negating the condition for Poisson scaling, and staircase scaling patterns 
emerge. 


2.4 Summary 


In this chapter the technology underlying every discussion of high-energy phenomena 
at hadron colliders based on first principles, perturbative methods, has been intro- 
duced. In order to calculate cross-sections, the idea of factorization must be invoked. 
This idea is at the heart of perturbative QCD at hadron colliders. First of all, factor- 
ization guarantees that the partons, the constituents of the protons, can be treated as 
quasi-free particles. This is the case if the characteristic time-scales related to the pro- 
cess probing them are sufficiently smaller than the typical response time of the strong 
field. Only then, when the partons can be treated as quasi-free, can they be quantized 
like any other field in quantum field theory. This allows a perturbative expansion, 
typically represented by Feynman diagrams, which is a systematic treatment based on 
the Lagrangian of QCD. At the same time, the validity of factorization allows a de- 
termination of the parton distribution functions (PDFs) in a process-independent way 
which can be used to evaluate cross-sections for all other processes. At leading order 
the PDF fa/n(x, pr) describes the probability to find a parton a in hadron h with a 
momentum fraction x at the factorization scale up. Since they can be related to the 
bound state stucture of the respective hadron they fulfil a number of sum rules, which 
in turn act as theoretical constraints. While the PDFs are non-perturbative objects, 
their evolution with the factorization scale is governed by the perturbative DGLAP 
equations. This evolution in turn can be employed to understand multiple softer emis- 
sions. In particular, initial-state radiation can be interpreted as the breakdown of the 
coherence of the multi-particle Fock state of the incident particles, where the quantum 
fluctuations populating the Fock state are governed by the evolution equation. 

At the same time, final-state radiation leads to a proliferation of particles in the 
final state by repeated emissions of secondary quanta. Again, the behaviour of the ra- 
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diation pattern is governed by an evolution equation. Due to the presence of massless 
quanta, such as photons or gluons, however, some care is necessary to obtain phys- 
ically meaningful results when analysing this additional radiation. This is typically 
reflected by certain phase-space conditions which lead to subsuming emitted quanta 
into observable particles. In the case of QED this effectively leads to adding sufficiently 
soft and/or collinear photons to the leptons emitting them; in contrast, in QCD the 
picture is further blurred by the fact that the quanta of the strong interaction, quarks 
and gluons, only occur in bound states, the hadrons. In order to arrive at meaningfully 
defined physical objects, the combination procedures must therefore be valid on the 
parton and the hadron level. They must not destroy the perturbative precision in the 
desciption of how these multi-particle states emerged from radiation in the first place. 
This then leads to the technical definition of jets through suitable jet algorithms. 

The development of the perturbative formalism has been exemplified by W pro- 
duction. This process has been one of the “standard candles” in the experimental 
programme at the TEVATRON and the first run of the LHC and it will continue to play 
a similarly central role also for the physics programme in Run II and beyond. 

Already at leading order, a rapidity asymmetry in the production of the two charge 
eigenstates W= of the W bosons emerges, which translates into a more convoluted mea- 
surable asymmetry of the charged leptons coming from the W decay. This observable 
is being used to further constrain the PDFs. 

Of course, going to higher perturbative orders increases the precision of the the- 
oretical results, but this enhanced quality of the prediction comes with a price. First 
of all, the corresponding matrix elements tend to diverge, necessitating regularization 
and renormalization of the ultraviolet divergences present in the virtual contributions, 
and requiring the treatment of infrared divergences occuring in the real and virtual 
contributions. Since the latter cancel exactly for all physically sensible observables, it 
is mandatory to regularize them in each contribution. Due to the different dimensions 
of the respective phase spaces and the different sources of the divergent structures, 
loop vs. phase space integrals, this presents a considerable nuisance in higher order 
calculations. To highlight this point, note that this problem, in fact, has been fully 
and robustly solved for next-to-leading order (NLO) calculations only. It should also 
be noted that only the inclusion of higher-order corrections allows the quantification 
of theoretical uncertainties in a meaningful way, by varying the renormalization and 
factorization scales appearing in the calculation because of the truncation of the per- 
turbative series. While in principle this seems to be a fairly straightforward exercise, 
there is some subtlety involved, especially when choosing a meaningful central scale. 

At the same time, the NLO calculations contain a contribution, the real emission 
part, that can interpreted as a new process. Instead of the inclusive production of a 
system, like the W boson discussed here, they describe the production of the system 
plus an additional parton, which can be interpreted as an additional jet. This indeed 
signals the start of a tower of ever increasing jet multiplicities described by corre- 
sponding tree-level Feynman diagrams, which contain more and more partons in the 
final state. With respect to higher-order calculations they form, on the other hand, 
towers of multiple real emission corrections. For the measurement of the W mass, the 
knowledge of the pı spectrum of the produced W bosons is indispensable, and the 
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process with one extra parton together with the W in the final state merely yields 
the leading-order expression for this observable. In order to increase the precision one 
could, again, invoke higher-order corrections. However the validity of a normal NLO 
calculation would be limited to non-negligible values of the W transverse momentum. 
This is because, already at leading order, the differential cross-section with respect 
to the transverse momentum of the W diverges for small transverse momenta. This 
divergence appears again at any fixed order of perturbation theory. However, a careful 
analysis of the situation indicates that a finite result can be obtained for small values 
of the W transverse momentum by resumming the soft and/or collinear limits of extra 
parton emission to all orders. 


3 
QCD at Fixed Order: Technology 


In this chapter, the perturbative description at fixed order of processes at the LHC 
and other hadron collider experiments, outlined in Chapter 2, will be further discussed. 
In particular, theoretical issues related to calculations for more complex final states 
will be addressed. First, the terminology used in this book to denote the accuracy of 
a given calculation is described in Section 3.1. 

Section 3.2 is then devoted to a discussion of the technology used in fixed-order 
calculations, in particular for the calculation of multi-particle final states at leading 
order (LO). 

This is followed by a discussion of techniques employed in next-to-leading-order 
calculations in Section 3.3, containing a presentation of subtraction methods for the 
treatment of the infrared divergences which facilate their mutual cancellation between 
virtual and real corrections and advanced methods to evaluate the former in processes 
for multi-particle final states. 

In the final part of this chapter, Section 3.4, some ideas are presented on how even 
higher orders in perturbation theory can be treated. 


3.1 Orders in perturbation theory 


To discuss in more detail technicalities and results of fixed-order calculations in QCD, 
it is mandatory to establish and define a suitable language first. While most peo- 
ple appear to have a clear idea what they mean when talking about calculations at 
“leading order” (LO), at “next-to-leading order” (NLO) or even at “next-to-next-to- 
leading order” (NNLO), these ideas do not always coincide. In the framework of this 
book, these notions will be used as follows: the order, at which an observable is cal- 
culated, is denoted as “leading order”, if the result of the calculation yields the first 
non-trivial contribution to the perturbative expansion to this very observable. Con- 
sequently, “next-to-leading order” denotes the order at which the first perturbative 
correction to the same observable is being evaluated. 

To see how this works in more detail, consider the by-now notorious example of W 
production, invoked in the previous chapter. Calculating the process, or to be more 
precise its cross-section, at leading order introduces the Feynman diagram in the left 
panel of Fig. 3.1. Due to the dependence on the value of x probed in the PDFs, this 
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Fig. 3.1 Feynman diagrams for W production at hadron colliders. The 
left panel is related to the calculation of the cross-section, the rapidity 
distribution of the W boson, or the transverse momentum and rapidity 
distributions of the individual decay products at leading order. In order 
to have a leading-order expression for the boson’s transverse momentum, 
real-emission diagrams such as the one in the middle panel must be con- 
sidered. Together with the virtual corrections. such as the loop diagram in 
the right panel, they also form part of the next-to-leading order correction 
to the total cross-section, etc. 


Table 3.1 The order at which various observables related to W production 
are computed, as a function of the overall power of the strong coupling in 
the theoretical calculation. 


strong coupling order | otot, do/dy | do/dpr(W) | da/d¢,;; 
a’ LO - - 
al NLO LO - 
a2 NNLO NLO LO 
ad NLO NNLO NLO 


diagram also predicts a non-trivial rapidity distribution. On the other hand, since 
the partons are collinear with the incoming protons, this diagram will not produce 
any transverse momentum distribution of the W boson. Instead it will generate such 
distributions only for its decay products. In order to have a leading-order expression 
for the p, distribution of the W boson, real-emission diagrams such as the one in the 
middle panel must be evaluated. There the boson can recoil against the additional 
parton. As seen in the previous chapter, for p” — 0 the expression related to this 
diagram diverges, a problem that actually persists for each fixed-order calculation and 
which can only be resolved through the use of resummation. However, this diagram 
not only yields a leading-order result for p” but it is also part of the nezt-to-leading- 
order calculation of the total cross-section or of observables such as the boson rapidity 
or the kinematical distributions of the W boson’s decay products. This situation is 
summarized in Table 3.1. 
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3.2 Technology of leading-order calculations 


Going back to the master formula for the evaluation of cross-sections for the production 
of an n body final state in hadronic collisions at leading order, Eq. (2.52), 


gO) = (Bom) = [ee B(s) 


II 


1 
D | dzada fajn ta, me)foynsler,ne) f AGO (ue, un) 
a,b 


0 


II 


1 
1 
S | deaden fay fajn (ase) fora (0r pe) F; Maron]? (Onis un )(31) 
a,b 6 


it is fairly obvious that already for this simplest result two major obstacles must be 
faced. 

First of all, the squared matrix element |M,p-,n|? must be evaluated. Since the 
number of diagrams increases very quickly, typically faster than factorial, the tradi- 
tional methods encountered so far become impractical. Squaring the amplitudes, using 
the completeness relations to arrive at traces which may be evaluated analytically is 
just too complex an operation. As a consequence, in modern techniques the focus is 
on evaluating individual amplitudes as functions of their internal and external de- 
grees of freedom, which means that every amplitude becomes just a complex number 
and in turn renders their summation and squaring a straightforward exercise. Modern 
algorithms to achieve this are introduced in Section 3.2.1. 

Second, the complicated structure of the phase space resulting in an integral in 
(3n — 2)-dimensions, possibly supplemented with complicated cuts, renders this high- 
dimensional integration impossible to be evaluated analytically, even if the PDFs were 
tractable in this way. As this is not the case, mainly numerical methods must take 
over, both in the evaluation of the matrix elements and in the phase-space integration. 
For the latter, essentially only Monte Carlo integration techniques are viable, since 
for them the error estimate scales like 1/ VN for the number N of integrand evalua- 
tions, independent of the number of dimensions.' A general discussion of Monte Carlo 
techniques can be found in Section 3.2.2. 

Sampling in such a way over the phase-space degrees of freedom to conveniently 
obtain a numerical estimate for the cross-section — the actual result — begs the 
question, how far sampling can also be extended to other degrees of freedom such as 
particle spins, colours, or similar. In other words, for the matrix element evaluation, 
a choice between summation and sampling over quantum degrees of freedom must 
be made. Since the computation times for summation and sampling naively differ by 
the st? power of the number of possible states, usually ~ 2° for the possible helicity 
assignments and + 3° ...8* for the possible colour assignments, dependent on whether 


lFor traditional integration methods such as trapezoid quadratures or similar, the number of 
dimensions enters such that usually the error scales like N-*/” where n is the number of integral 
dimensions and k > 0 depends on the method of choice. 
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the particles in question are quarks or gluons, this may have a significant impact on 
the overall evaluation time, especially if there are strong correlations also with the 
phase space, which would thus favour a simultaneous sampling over external (phase 
space) and internal (spins, colours, etc.) degrees of freedom. By suitably eliminating 
common sub-expressions in the matrix elements, however, these naive factors can often 
be reduced quite considerably, which renders this choice strongly dependent on the 
process and the particle multiplicity in question. 


3.2.1 Evaluating matrix elements at leading order 
3.2.1.1 Helicity amplitudes from Feynman diagrams 


In this section, modern methods for the fast numerical calculation of cross-sections 
like the one in Eq. (3.1) will be introduced, focusing on leading-order or tree-level 
evaluations. One of the first apparent problems in evaluating scattering amplitudes, 
already at leading order, is due to summing over external polarizations through com- 
pleteness relations, resulting in the need to evaluate about N?/2 terms for a process 
with N Feynman diagrams. If, instead, this summation is not being performed, every 
single one of the N individual contributions would just be a complex number, depen- 
dent solely on external momenta, colours, and spins. Naively, for n massless external 
particles with two helicity states each this would thus lead to 2” complex numbers 
to be evaluated. Considering this behaviour, the actual gain in computational effort 
through the latter method is based on the realization that the number of Feynman 
diagrams, N usually scales faster than factorially with n and therefore the number of 
contributions to be evaluated when using the traditional method becomes more like 
N?/2 < (n!)?, an exploding number of terms. 

This reasoning is the basic idea behind a set of methods which could collectively 
be denoted as the method of helicity amplitudes. In this method, any Feynman am- 
plitude (represented by propagators and vertices for the internal lines and spinors and 
polarization vectors for the external particles) is translated into a complex number, 
dependent on external helicities and momenta. This method will ultimately rely on 
smart ways to represent the building blocks of the amplitudes to allow for a reasonably 
fast and efficient numerical evaluation. However, before even entering a quick discus- 
sion of how this could be achieved there are some tricks to accelerate the computation 
time. 

First, the amplitude may vanish for certain helicity combinations. For instance, for 
massless fermions, a helicity-flip in the interaction with a spin-1 particle is disallowed 
due to the conservation of angular momentum,” translating into the conservation 
of helicity along a fermion line and thus the vanishing of half the helicity combinations 
in the full process. 

As a second trick, it is of course possible to recycle certain sub-amplitudes. As an 
example consider the two Feynman diagrams contributing to the process ug > dgt ve 
displayed in Fig. 3.2. The parts in the boxes are identical and thus only need to be 


? As an analogy, one may think about allowed and suppressed electromagnetic transitions of excited 
atoms, which are identified with E and B transitions. In the Gordon representation, the latter 
scale with a term proportional to the particle mass. 
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Fig. 3.2 Two Feynman diagrams contributing to the process ug —> dgl” ve 
at leading order. The parts of the diagrams in the coloured blobs are iden- 
tical; when constructing the helicity amplitudes, they can be factored out 
and recycled to speed up the evaluation time for the total amplitude. 


evaluated once for both diagrams depicted and for similar ones. Using such recycling 
recursively will reduce the number of complex multiplications like the ones in the 
spinor products and thus increase the efficiency of the calculation, especially when such 
common sub-amplitudes are identified in the construction of the helicity amplitudes 
and factored out from the beginning. 

In order to identify and isolate such sub-amplitudes, it is important to realize that 
tensor structures in Dirac space in the numerators of propagators can be re-expressed 
through spinors as follows: 


ptm= ; 5 (+ =| u(p, A)ti(p, A) + (: - 5) v(p, A)U(p, »] - (3.2) 


à 
A similar relation also holds true for the propagator numerators for vector particles, 
with the added complication of gauge choices for gauge bosons. Typically to fix the 
gauge, a light-like gauge vector q” is introduced, essentially fixing the axis of an axial 
gauge and resulting in 


quPv + WP * 
Gu + = = E = S elp, A)& (p, A). (3.3) 


As further discussed in Appendix A.2, this allows us to rewrite the polarization vectors 
as spinor products such that the full amplitude can be recast in the form of spinor 
products. 


3.2.1.2 Dealing with spinors and polarizations 


Of course, the reasoning above necessitates the representation of the spinors and polar- 
ization vectors of the external states in a suitable way. While this is perfectly possible, 
it will most likely also be very cumbersome if treated in such a blunt way due to 
the occurrence of massive matrix multiplications, stemming for instance from the no- 
torious Dirac y matrices. A better way is to try to decompose the amplitude into 
scalar products of spinors and of Lorentz structures, where the latter ones are fairly 
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straightforward to implement. There are different ways to represent spinors and their 
products, as discussed in more detail in Appendix A.2. 

One frequently used spinor representation is based on two-component spinors, 
also known as Weyl spinors, which indeed form a basic representation of the Lorentz 
group. To see, in short, how this works, it should be realized that the Lorentz group 
is generated by the usual six infinitesimal boost and rotation transformations along 
and around the z, y, and z axes, respectively. While individually the commutators of 
these generators form a somewhat nonintuitive algebra, they can be rearranged into a 
set of six linearly independent linear combinations which decompose into two SU (2) 
algebras. In other words, the Lorentz group is a (locally isomorphic) product of two 
independent SU(2) groups. These groups, in turn have their first non-trivial represen- 
tation through two-component objects, or Weyl spinors, associated with left-handed 
and right-handed spinors. The components of these two kinds of Weyl spinors are 
indicated by dotted and undotted indices a and 4d, each of which ranges only from 
1 to 2, a, a E {1, 2}. 

For instance, for a massless four-vector k, 


gial j 


where the light-cone momenta kt = k? + k? and ġx is the angle of the transverse 
momentum ki. By using the complex plane, k, = kı +ik° and dg = arg(k1ı). There 
is a fair amount of freedom in this definition of the spinors, like, for instance, the 
definition of the axis defining the light-cone momentum directions or the orientation 
used in the definition of the angle ¢;,. This freedom can then be used for checks of the 
actual calculation, which must be independent of these choices. 

However, with these definitions, the scalar products of such spinors ¢4(k) = |k} 
and Ça(k) = |k] can be written as 


na(k)C*(q) = nalk) Gla) = (ka) 
nalk) (a) = [kq] = (kg), 


where the latter relation holds true because trivially one could identify n_ = ņnă and 
where the fact that the individual spinor components must be Grassmann numbers, 
anti-commuting, is encapsulated in the “spinor” metric e% given by 


a ah 01 
gH’ Sete er (3.6) 


Note that it has been customary to use the momentum label of the spinors in the two 
scalar products and to identify whether it is over the dotted or undotted spinors only 
by the shape of the bracket, a convenient short-hand notation. Using them, regular 
massive Dirac fermion spinors known from the usual textbook methods can be written 
as two such Weyl spinors, arranged in a bi-spinor; this, however, introduces a “gauge”- 
like degree of freedom into their definition. 


(3.4) 


(3.5) 
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Another important building block of scattering amplitudes is the polarization vec- 
tor for external or internal gauge bosons. In general, four-vectors k” can also be rep- 
resented by Weyl spinors, by realizing that 


k” = ofp C’ (K)C (k), (3.7) 


where g? = 1 is the two-dimensional unit matrix and the of (i € {1, 2, 3}) are 
the Pauli matrices. By introducing a light-like gauge vector q, polarization vectors for 
external massles particles with four-momentum p” can be written as 


1 (qFlo"|pF) 
ei (p,q) = +5 (3.8) 
j v2 (q*p*) 
For polarization vectors for massive external gauge bosons, cf. Appendix A.2. 

It is also interesting to note that regular scalar products of four-vectors can be 
rewritten in the form of spinor products, for instance 


2k -q = (kq) [kq] . (3.9) 


0 


It is relations such as the one above which render this spinor representation a versatile 
and powerful tool to rewrite Feynman amplitudes in a form that lends itself to the 
automation of scattering amplitude calculations through numerical methods. For more 
details, cf. Appendix A.2. 


3.2.1.3 Dealing with colour 


An interesting problem when constructing the amplitudes presents itself with the inclu- 
sion of colour. Quite often, relevant theoretical result are formulated in the large-N, 
limit, and sub-leading terms usually suppressed by 1/N? or similar are omitted. It 
is worth remembering that quarks, being in the fundamental representation of 
SU(N.) carry colour indices i € [1, Ne], while gluons, residing in the adjoint rep- 
resentation, carry an adjoint index a € [1, (N2 —1)]. The fundamental interaction 
between both is mediated by terms proportional to the generators T4 of the fun- 
damental representation of SU(N), normalized through 

T TF, = 5”. (3.10) 
The self-coupling of gluons, mediated by the generators f% in the group’s adjoint 
representation, can be rewritten through the Ti as 


if Ty = TE Ps. (3.11) 


Ultimately, the colour algebra allows the decomposition of every n-gluon amplitude, 
which includes the most complicated colour structures, according to 


AM Doar eS Soo eTA ESO A Ly Garies On) (3.12) 


oESn-1 
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where o denotes all (n—1)! permutations of the indices 2...n, and the colour-stripped 
or colour-ordered amplitude is denoted by A(1, 02, ..., On), which only depends on 
the momenta and helicities of the external particles. Of course, similar decompositions 
also emerge when some of the particles are quarks or even colour-neutral objects, 
which typically leads to a simplification of the overall result in terms of colour. As 
a welcome by-product, the colour-ordered amplitudes A(1, a2, ..., Gn) correspond to 
planar graphs only, when it comes to their QCD parts, which is due only a small 
set of Feynman amplitudes contributing to each of them and which renders them 
simpler to calculate. Some of their most remarkable properties are defined by the 
Kleiss—Kuijf relations [681] yielding linear relations among such amplitudes. One of 
the consequences of these relations is that the maximal number of colour-independent 
amplitudes is (n — 2)!, showing that colour ordering (CO) does not yield a minimal 
set of such amplitudes. Such a set can be achieved by using the adjoint representation 
instead, cf. [452, 453] for details. 

Another decomposition to coloured amplitudes, known as colour dressing, has 
been proposed in [651, 739]. Based on actual colour flows it treats the SU(N.) gauge 
field, the gluons, as Ne x Ne matrix, thus making the matrix character of the gluon field 
a j= G,, T= in the fundamental representation more explicit than by denoting it 


with an index a in the adjoint representation. Considering a term T“T“%, corresponding 


ij” kD? 
to a gluon exchange motivates why this may be helpful for numerical implementations: 
i l i l 


eee 1 1 
ie = ôitðkj — NT = i TN D A € (3.13) 
J k J k 


This sketch also explains why this is based on colour flows — both terms correspond 
to connecting indices of fundamental SU(N.) objects, with a sum over independent 
colours that yields exactly the (N2 — 1) degrees of freedom present in the gluon field. 
This also means that every QCD vertex can be written as a sum of 6 functions in 
colour space, connecting the quark and gluon colour attached to it in all allowed 
combinations. This allows for a fairly straightforward implementation in terms of a 
computer code, and replacing the potentially cumbersome colour algebra with factors 
of one or zero further accelerates the evaluation of the amplitudes. 


3.2.1.4 Recursion relations 


This idea of recycling identical parts of the amplitude is brought to perfection by using 
recursion relations from the beginning. The basic idea here is to create one-particle 
off-shell parts of the amplitude recursively in the spirit of the Dyson-Schwinger 
equations [493, 840] known from text-books on quantum field theory, where they 
are used to construct one-particle off-shell Greens functions. This very idea has been 
put into effect in different realizations, directly in the HELAC code [308, 486], or in 
some variations such as the ALPHA algorithm [326, 327], on which ALPGEN [743] and 
O’MEGA [770] are based, or as Berends—Giele relations [219—222, 681], implemented 
in COMIX [582]. 
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The idea in all of them is to construct generalized currents Ja(r) for a set m of 
external particles on their mass shell plus one internal one, which then of course may 
be off-shell. In this construction, œ denotes the combined quantum numbers of this 
internal particle, including spin, colour, etc.. As a somewhat special realization, the 
external particles may also be considered as currents; then the quantum numbers a are 
directly identified with the ones of this particle. From these special cases, recursively 
more and more complex currents are built, by applying 


Tam) = Palt) X) SD [8m m2) VE” Ja (11) Jos (12)| 


Pala) vars? 


+ SSS (SG, m2, 3) VE Ja, (71) Fag (72) Tas (m3)] 
Ps(m) vareres 
(3.14) 


In this equation, P,(7) denotes the propagator denominator, which of course depends 
on the properties a of the propagating particle and on its momentum given by the 
momenta in m. The terms S(m1, 72) and (7, 72, 73) take care of symmetry factors, 
which emerge in the partitions Po(7): m —> mı ® T2 and Pa(m): t > Tı ® T2 O73 
of the original set. Finally, the Va signify the three- and four-leg vertices connecting 
the particles a; of the sub-currents with the emerging new particle a. This allows us 
to write the total amplitude A(z) for one specific configuration 7 as 


MOI) Gn an (rlo), (3.15) 


where |p denotes the residual subset of m after the subset p has been taken off. 
Overall conservation of quantum numbers also guarantees that the combined quantum 
number a,,|, of this conjugate subset is the conjugate of the set p, &p. It is worth noting, 
however, that it is computationally advantageous to map the four-vertices onto vertices 
with three legs only. 

For some specific cases it is possible to solve the recursion equation in Eq. (3.14) in 
closed form [220, 692]. For instance, a current with n external colour-ordered like-sign 
gluons is given by 


(ly Fla) 
V2 (q1) (12) (23) ... ((m — 1)n) (ng) 


where the four-momentum PH" j is constructed from the outgoing momenta through 


(3.16) 


PE =N prs (3.17) 


and where q” is the gauge vector. This current can be used, to prove the form of 
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the maximally helicity violating (MHV) or Parke—Taylor amplitudes, which 
correspond to all gluonic, colour-ordered amplitudes with all equal signs apart from 
two, labelled by 7 and j in the equation below. Their form was conjectured for the first 
time in [797] and proven in [220] to be 


MT ORS aah Gee sna” pd awaits nt) = ign? (ij) 
> ry , ; ; ry ) s (12) (23) shane ((n = 1)n) (n1) (3 18) 
+ 74 . 
Ghee ees See Ae eee n7) = igh? lij] 


[12][23]... (n — Dnjin1] 


Note that the all-sign identical amplitudes vanish due to the conservation of angular 
momentum. 


3.2.1.5 On-shell methods 


The study of more general properties of scattering amplitudes, especially in the context 
of the super-renormalizable M = 4 Super-Yang-Mills theory, has experienced a some- 
what surprising renaissance in the early 2000s, leading to a remarkable progress in the 
understanding of perturbative QCD from twistor-inspired methods [897]. Building on 
a correspondence with some well-understood type of string theory, it could be shown 
that colour-ordered tree-level amplitudes can be related to curves in twistor space, 
giving rise to the Cachazo—Svrček-Witten (CSW) vertex rules [307]. Even loop 
amplitudes were shown to follow the same principles and rules [306]. These rules state 
that arbitrary colour-ordered scattering amplitudes can be constructed from MHV 
amplitudes [306, 307, 897]. They serve as generalized MHV vertices and are connected 
by scalar propagators, resulting in a full n-gluon amplitude being built from (n — l) 
same-sign helicity gluons (and l gluons with helicity of the opposite sign) arranged in 
(l — 1) of these vertices. 

As an example, consider Fig. 3.3, displaying the construction of the colour-ordered 
six-gluon amplitude A(1~, 27, 37, 4*, 5t, 6+) from MHV vertices. As a consequence 
of this construction, for any n-gluon amplitude MHV vertices for up to n particles 
may contribute, implying that the number of such vertices that are needed for a 
cross-section calculation grows steadily with the number of external legs. This problem 
has been addressed in [211], reformulating the CSW rules in a fully recursive fashion. 

A further refinement of the CSW rules has been worked out in by Britto, Cac- 
hazo, Feng, and Witten in [286, 287], stating that any colour-ordered tree-level 
amplitude can be constructed from two on-shell amplitudes with a scalar off-shell 
propagator in between them. This yields the BCF recursion relations, which can be 
summarized as 


n—2 
A eo 1 K 7 
An(1, 2, pers n) = 5 Ark+ı(1, 2, meig k, =T) pP, An-k+1 (Pik k+ 1, Aa sf), (3.19) 
k=2 1,k 


where the momentum of the propagator is given by the sum of external momenta, 
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Fig. 3.3 The six MHV graphs for the construction of the six-gluon am- 
plitude A(1~, 27, 37, 4+, 5+, 6*) in the CSW formalism. 


k 
pt, = >_ vt, (3.20) 
i=l 


and where the denotes momenta that are shifted by an amount z, 


Bik = Dik + ZÀnA1 
pr =pitzrnA (3.21) 
Pn = Pn + zàn. 


Here the À and À denote the co- and contra-variant components of the spinors for 
light-like momenta, 


DPEN, (3.22) 
and the shift parameter is given by 
2 
Pik 
z = ——_. 3.23 
(lp. aD a) 


In contrast to the CSW rules this implies that the sub-amplitudes contain on- 
shell particles only, which makes them easier to calculate. They are, however, not 
gauge-invariant objects due to the need of a gauge vector to define the opposite gluon 
helicities, which enter through the shift parameter z. It is interesting to note in this 
context that the shifted momenta pı and p, are complex valued but still on-shell and 
light-like, adding an interesting twist to the calculation. Finally, it is important to 
stress that of course these formalisms have also been extended to include quarks or 
the case of QED interactions. 
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3.2.2 Phase-space integration 


Turning now to the issue of phase-space integration through numerical methods, 
already simple considerations show that traditional quadrature techniques like the 
trapezoidal or Simpson method will not converge fast enough for large numbers n 
of final-state particles.? Instead, the method of choice turns out to be Monte Carlo 
integration, which due to its statistical nature is guaranteed to have an uncertainty es- 
timate that scales like 1/ VN with N the number of phase-space points. The basic idea 
of Monte Carlo integration is to replace the integration of a function with sampling it: 


N 
I= | aos) > UD = BIG) = Ne. (3.24) 
V i=1 


Here # denotes a point in the D-dimensional finite phase space with volume V, and 
the x; are uniformly randomly distributed in it. In Monte Carlo integration the exact 
value of the integral I is estimated as an average over N test points in the volume. 
The central limit theorem guarantees that with infinitely many calls N — oo the 
estimator (I) + I. The name of the game in Monte Carlo integration is to control this 
convergence, through an error estimate (E) that will scale like 1/VN. A convenient 
error estimate is the variance, written in useful form as 


N 


N 2 
EM): => (re) = (50) = |(P)e—(f)- (3-2) 


i=l 


Various methods have been discussed in the literature to accelerate error reduction. 
The most prominent ones are known as importance sampling and stratified sam- 


pling. 


3.2.2.1 Importance sampling 


The underlying idea of importance sampling is to use a mapping of random numbers to 
function arguments 7, the four-momenta in the case of particle physics, which captures 
as far as possible the distribution given by the function, the squared matrix element 
for the process in question. In this method a probability density g(x) for a vector of 
random numbers is constructed in such a way that f(#)/g(z) is better behaved than 
f(z) alone. Then the D-dimensional Monte Carlo integration of f(X) over D uniformly 
distributed random numbers x; becomes 


((f))e = fares) = [Pea f (2) 
= fap BO = D = (7), (3.26) 


IÒ 


3For a summary of Monte Carlo integration methods, see for example the review [648]. 
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where the p; are distributed according to the phase-space density g(Z). If f and g 
are sufficiently similar, their ratio f/g will fluctuate less than f alone, leading to a 
smaller variance and therefore an accelerated convergence. The limiting factor in this 
is obvious: a function g must be found, which has to be inverted in order to generate 
the points p. Furthermore, to keep things simple, the g(Z) must be non-negative in V 
and integrate to unity 


J Pz) = 1, (3.27) 
V 


which can be trivially achieved by normalizing them to their integral over V. 


3.2.2.2 Stratified sampling 


Simply put, stratified sampling aims at improving the integration performance by 
binning the integration region V and by distributing the phase-space points with a 
different density in the bins. The overall integral and the square of the overall variance 
are the sum and the quadratic sum of the single integrals and squares of variances in 
the bins b: 


(E)E = J Eie (3.28) 


The overall variance will be minimized by making the individual variances in each bin 
of equal size. This can be achieved by introducing a different a priori probability a, for 
picking a bin b to generate a phase-space point in it, and by updating them regularly 
after a sufficiently large number of function calls. Bins with larger variance after such 
an optimization step, with more fluctuations, will obtain a larger ay and therefore an 
increased number of points sampling the phase space in this bin; conversely bins with 
smaller variance, less fluctuations of f(Z), will have a smaller a, and fewer sampling 
points. Ultimately, the best theoretical solution is to have a priori weights ay which 
behave like the variance in each bin. Therefore updating them by multiplying them 
with the variance or similar will accelerate the convergence. 


3.2.2.3 Isotropic phase space 


Ignoring for the moment issues related to the sampling over the initial-state parame- 
ters, zı and x2 or, complementarily yems and 8, a logical first attempt at Monte Carlo 
phase-space integration for the final state consists of an isotropic phase-space sampling. 
In such a sampling, for a given centre-of-mass energy squared E2 s = P? = 8, the n 
momenta p} populate the phase space homogenously, but still respect overall four- 
momentum conservation and mass-shell conditions for the individual four-momenta: 


= (3.29) 
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These requirements are incorporated through 6 functions in 


N 
(2n)*654 (> -5 rt) (3.30) 


i=l 


N 


ax =T] i (27)5(p? — m?) O(E;) 


where the O(£;) account for the projection on physical, positive energy solutions for 
the outgoing particles. 

For massless particles, it is fairly straightforward to calculate the volume of the 
final state phase space. As all ingredients in the N-particle phase space of Eq. (3.30) 
are formulated in Lorentz-invariant quatities, in the following P“ = (Eems, 0) can 
be safely assumed. Following [685], first an “unconstrained” phase-space volume y 
is introduced, based on unconstrained momenta q; not fulfilling overall momentum 
conservation like the constrained p;: 


PA A 4g; 
[ean = f | a ante ota) a) = |p | wte 
i=l 0 
1 


Tea 


(27)2N 7 


Here, f(g?) denotes an arbitrary regulator function that keeps the overall volume finite 
— for the choice made here the result is displayed.* In a second step a transforma- 
tion between unconstrained momenta q/' and constrained momenta pi’ is found as a 
combiniation of a scaling operation parameterized by x and a Lorentz boost given by 
b 


ya +b: Gi 
th ys u = eee 
pi = cH’ (q) = 2 S DAT |- (3.32) 


1+7 
Consequently, the inverse transformation is given by 
p lyru 
qG=- H" (pi) (3.33) 
x 


With M = ,/Q? the invariant mass of the unconstrained system, the boost parameters 
b and y and the scaling parameter x read 


p E 
eman = gt eg Rea 


4Integrating over the spatial components of the massless momenta is trivial: the 6 function guar- 
antees that |i] = q? and therefore 


Bai 
(2m)4 


(PEPI a 
1673.99 Ar” 


(21)6(q?) = 
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Inserting these transformations into the unconstrained phase space and after a few 
manipulations detailed in [685], the two phase-space volumes are related with each 
other through 


bs 4 be 35 z 
[ob z P oa fens (z- | 5 (5 z) 
N 4p. (27 2 ; 
Il É pi(2 eee f (+ He)]| (3.35) 


i=l 
d d85 Et N 1 
= Gays { II 1 (42°09) dx) ; 
t=1 


With the choice made for the limiting function f(x), 


ig G Hgo) = exp -7l , (3.36) 


such that only the integration over band x remains, which results in 


en ee (F) BOT E)TO-1TEn) ga 
i =f) mes (27) qa?N+H (an) T(n+35) . 
0 
This fixes the unconstrained phase-space volume to 
E2N-4 
dy = f 3.38 
N  2(4n)2N-3P(N)P(N — 1) (238) 


To turn this into a Monte Carlo algorithm, it is sufficient to generate N uncon- 
strained momenta q}, with isotropic angular distribution and an energy density given 
by q? exp(—q?). Denoting with # a random number uniformly distributed in the in- 
terval [0, 1], the q; are obtained from 


i = 2n-#, cosh; = 1-2-#, ag? = —log(#- #). (3.39) 


They are transformed into the constrained momenta p% through the transformations 
detailed here. This is the RAMBO algorithm invented in [685], where also the general- 
ization to massive momenta has been worked out. 


3.2.2.4 Democratic phase-space mappings for pure QCD processes 


The isotropic phase-space generation introduced above is a good example of a demo- 
cratic approach, where all particles are treated on the same footing. It should be 
intuitively clear that a uniform phase space, while simple, elegant, and transparent, 
will not deliver good results in cases where the actual distribution of final-state mo- 
menta is not uniform, which typically is the case in all relevant applications. There 
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is, however, a class of processes where another democratic approach to phase-space 
integration has been worked out and indeed delivers results superior to the ones pro- 
vided by the RAMBO algorithm above. The processes in question are those with QCD 
only final states, where the leading amplitudes assume a particularly simple form. For 
instance, for purely gluonic processes the most singular amplitudes are the MHV and 
MHV ones, which for a colour-ordered n-gluon process behave like 


n 
AMEVMEY x |] = with ŝn(n+1) = Sea) (3.40) 
ja Sili!) 

cf. Eq. (3.18). This particularly simple form is also recovered in other processes; the 
inclusion of quarks typically just leads to the absence of some of the 1/8 factors, 
therefore rendering the amplitudes somewhat less complicated to integrate. 

A first method that has been optimized for the integration of such amplitudes has 
been introduced in [487] and goes by the name of SARGE. It builds on sequentially 
filling the phase space through emissions that follow the basic antenna density 


1 (pipi) i jk 
dA}, = -d ppp? O (pp) — = g (éi IEN 3.41 
ij = z PROPE (px) (pipe) (Ppr) g (Sij) 9 o ) (3.41) 
where the ( ) 
jk _ \PjPk 3.42 
$3 (pip;) oe 


are the arguments of the regulator functions 


1 


= Doge, OE ~ Em Om — 8), (3.43) 


g(f) 
which ensure that the divergences for (pipk) — 0 and (pjpk) — 0 are avoided and 
are chosen such that dA integrates to unity. The cut-off value ém depends on the 
actual cuts being applied on the outgoing momenta. Demanding that for each pair of 
momenta i and j, (pi + pj)? > so, the expression for Em is, 


E= : (n+ Die + 2) . (3.44) 


where § is the energy squared of the partonic system in the centre-of-mass frame. 

To generate a momentum px according to the basic antenna structure, Eq. (3.41), 
SARGE proceeds along the following steps. The initial momenta p; and p; are boosted 
into their centre-of-mass frame. Then, two numbers i and g are generated with a 
probability density of g(£)/£. They allow the calculation of the energy of particle k, 
p?, and its polar angle 6 with respect to p;. The azimuthal angle is chosen uniformly in 
[0, 27], which then enables the construction of pp in the rest frame of i and j. Boosting 
pr back into their lab frame finishes the generation of one emitted momentum. It is 
worth noting that this algorithm does not respect four-momentum conservation, since 
recoil effects on the emitters 7 and 7 are not captured here. 
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This algorithm is now iterated to yield all outgoing momenta along the following 
lines. Generating two outgoing momenta qi and qn in the centre-of-mass frame of 
the partonic collision triggers the consecutive generation of the (n — 2) other momenta 
through the basic antennae d.Aj,,dA3,, ... dA Gaan already discussed. Using the same 
boost and scaling transformations introduced above for the RAMBO algorithm [685] 
yields the final momenta p;, which indeed, by construction, satisfy four-momentum 
conservation. 

This democratic algorithm can be further improved by intelligent symmetrization 
over colour orderings (outgoing momenta) and better maps, cf. the orginal publica- 
tion [487]. Furthermore, going from a symmetric way of antenna generation to a more 
hierarchical way, as implemented in the HAAG algorithm introduced in [880], leads to 
a further refinement and, consequently, an accelerated convergence of the integration. 
These more specific algorithms are good examples of how improvements in Monte 
Carlo integration can be achieved by employing knowledge about the characteristics 
of the underlying process. In the following this intuitive picture will be substantiated 
a bit more through a more detailed discussion of optimization procedures in the lit- 
erature and the introduction of the by now widely employed method for Monte Carlo 
phase space integration in particle physics. 


3.2.2.5 Hierarchical mappings 


In contrast to democratic approaches with fairly symmetric final states there are of 
course also processes for which the opposite is true: the final-state particles emerge 
from a very specific set of Feynman diagrams which could be interpreted as a se- 
quence of production processes of usually resonant particles followed by their decay 
into lighter particles. A good example of this would be the production of a tt pair 
and its subsequent decay into two b quarks and two W bosons, which in turn decay 
further. In such a case democratic and, in particular, isotropic mappings will not be 
very efficient, since the particles are just not distributed independently in phase space 
but will enjoy strong correlations due to the intermediate resonances. Knowing this 
and understanding the underlying phase-space structure lends itself to the textbook 
Monte Carlo method of importance sampling for efficient phase-space generation. 

In the example of tt production and decay this could be achieved for instance along 
the following steps. Assuming first a useful distribution of the tt pair in terms of its 
centre-of-mass energy and rapidity, which for an e~e* collider could naively be fixed, 
while for hadron colliders PDF effects would have to be taken into account. Then, in 
the centre-of-mass frame of the pair, the top and the anti-top would be isotropically 
distributed, with back-to-back kinematics, and with their invariant masses individually 
given by a Breit—Wigner distribution. Each of the top decays, again, could be treated 
in their respective rest frame, with the invariant masses of the W’s again given through 
a Breit-Wigner form. Of course, this process would be finally repeated also for the 
Ws. 

This implies a hierarchical structure of Breit-Wigner distributed invariant masses 
of the intermediate resonant particles (or a similar distribution for the overall centre- 
of-mass energy of the total system) followed by their binary decays, which in the 
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respective rest frames reduces to choosing one pair (0, ) fixing the orientation in the 
solid angle of the back-to-back binary decay kinematics. 


3.2.2.6 Multi-channel method 


The case of the example above can be further extended also for processes where a 
number of such hierarchies compete with each other. Up to now the best solution for 
such cases can be constructed by combining different mappings in a dynamic form. 
This is known as the multi-channel method, which has been presented for the first 
time in the framework of particle physics in Ref. [682]. Expressed in technical terms, 
this method presents itself as an interesting amalgamation of the traditional Monte 
Carlo integration methods of stratified and importance sampling, where the binning 
of the stratified sampling is not in actual integration parameters or phase-space but 
rather in independent functional forms of individual mappings. In other words, multi- 
channeling performs a stratified sampling over various importance samplings. 

Mathematically speaking, a mapping function g(Z) is constructed as a sum of 
individual mappings g,;(Z), each of them with an a priori weight aj: 


od) = Sa Dasa (3.45) 


As before, the estimate for the integral is given by 


Me = (fie = (f/9)5 (3.46) 


and the error estimate is 


OEN ODEON aa 


While in principle the integral does not depend on the aj, its estimator and, more 
importantly, the error estimator does. This can easily be understood by writing the 
expressions above again as integral. In J the mapping function g cancels, but this is 
not the case in the first term of E: 


V 
> 2 2f 
T 
E= f Pr) (43) -P fst @) _ pp (3.48) 
g(a) g 
V V 
From here it would be trivial to rediscover stratified sampling by using mappings g; (x) 
which are unity inside the bin j and zero outside. 
This, however, is not how multi-channel sampling is being used for the phase space 
integration in scattering processes in particle physics. There, the form of the transi- 
tion matrix element is known. Using, for instance, Feynman diagrams to construct it, 


Technology of next-to-leading-order calculations 117 


one can obtain individual phase-space mappings gj respecting the kinematics of an 
individual diagram, which is mainly driven by its propagator structure. In principle, 
they can be represented by intuitive mappings: a simple pole of the form 1/8 for a 
massless particle, a Breit—Wigner form, defined by the mass M and width I of the 
propagating particle, 1/[($ — M?)? + M?T?], and a t-channel propagator, 1/(¢— m?). 
Similar structures also exist for decays, for example isotropic two-body decays of a mas- 
sive particle, or anisotropic decays to capture, for example, the emission of a gluon 
or photon in the final state. The individual mappings are then composed by a se- 
quence of decays and propagators, basically a hierarchical mapping for each diagram. 
This is fairly straightforward. What is typically not clear, though, is which specific 
phase-space configurations dominate the behaviour of the cross-section, especially in 
the presence of non-trivial cuts on some of the outgoing momenta. In such a setup, 
when a suitable linear combination of the most important mappings is hard to find, 
importance sampling loses its power. Then, the stratified sampling part of the integra- 
tion does the work and automatically finds a well-suited integration setup. A variant 
of this technology, coined single-diagram enhanced (SDE) multi-channel phase-space 
integration has been introduced in [740]. 

It is worth noting that this technology can be further refined, for example by 
using stratified sampling in the phase-space mappings first to find the most efficient 
combination of them, and by then using, for instance, VEGAS [725] to increase the 
efficiency of the “best” channels. 


3.2.3 Automated leading-order tools 


The methods presented in Sections 3.2.1 and 3.2.2 form the core of a number of fully 
automated tools which allow the generation of events at the parton level. Such tools 
are sometimes also called “matrix element generators”. Their performance depends 
mainly on two construction choices, namely 


e on the methods used for the evaluation of the matrix elements squared: the con- 
struction of Feynman amplitudes, traditional squaring, and using completeness 
relations, or translating the Feynman amplitudes into helicity amplitudes, or 
abandoning the language of Feynman diagrams altogether and using on-shell or 
off-shell recursion relations instead; and 

e on the methods used for phase-space integration and, possibly, the treatment of 
colour degrees of freedom. 


In Table 3.2 publicly available leading-order tools are listed and roughly categorized. 


3.3 Technology of next-to-leading-order calculations 


When promoting the calculations at leading order discussed in some detail in the previ- 
ous section to next-to-leading order accuracy, the master formula for the evaluation of 
production cross-sections of an n-body final state in hadronic collisions, cf. Eq. (2.52), 
will be altered. This is because the partonic cross-section, 6 now receives contributions 
not only from Born-level matrix elements but also from virtual and real corrections. 
Therefore, at NLO 
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Table 3.2 Publicly available tools for leading—-order cross-section calculations at hadron 
colliders. For the matrix element part of the evaluation different methods are being used: 
off-shell recursion relations in different algorithms, denoted by “off-shell”, Feynman diagram 
based helicity amplitudes, denoted by “hel. amps”, and the traditional method based on 
squaring the amplitudes and using completeness relations for external particles and traces over 
the Dirac algebra, denoted by “traces”. Related to this, different methods for the treatment 
of coloured particles are also listed, in particular the direct evaluation of the colour algebra 
(“explicit”) and the method of colour dressing (CD) or colour connections (CC) supplemented 
with a sampling over colour and helicities. For the phase-space integration, various versions 
of automated multi-channel sampling methods are being used (“multi”), in addition to some 
more process-specific solutions. 


ME: |Mapn/" PS: d®y 
colours & helicities 

ALPGEN [743] off-shell [326, 327] process-specific 
explicit 

AMEGIC++ [696] hel. amps [197, 684, 741] | automated multi [682] 
explicit 

COMIX [582] off-shell [220] recursive multi [299] 
CD [651, 739] & sampling 

COMPHEP [266]/ traces specific single-channel 

CALCHEP [210] explicit 

HELAC/ off-shell [651] automated multi [795] 

PHEGAS [308] CC [869] & sampling 

MADGRAPH [150] hel. amps [773] automated SDE multi [740] 
explicit 

O’MEGA/ off-shell [327, 328] specific [789] 

WHIZARD [677, 770] || explicit 


1 
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(3.49) 


where the various contributions — Born term B, virtual correction term V, and real 
correction term R — are given by suitably helicity-summed or averaged matrix el- 
ements. Denoting the perturbative order of the Born-level contribution with b, and 
indicating the orders of the matrix element M accordingly, therefore 
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At this point it is important to stress that the Born-level and virtual matrix elements 
are multiplied to yield the overall virtual correction. Thereby, the latter matrix ele- 
ment emerges from the former by adding a closed loop without changing the external 
particles of the process ab + n. Both matrix elements share the same phase space, 
essentially an n-body final-state phase space, ®g, see below. In contrast the real cor- 
rection emerges as the square of matrix elements with one additional outgoing particle; 
this may lead to replacing an incoming particle type a or b with a’ or b’. For instance, 
an incoming quark a may be replaced by a gluon, which splits into an outgoing anti- 
quark and a quark a such that for the process in question ab — n is replaced by 
gb > n + ā. With this change also the PDFs must change accordingly. 

Turning to the phase-space elements, the expressions for the Born-level and real 
correction phase space are given by 


1 
d®g = drqdty fajn (La, UF) S/he (Xb, be) as d®,, 


(3.51) 
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28 arp 


with 


ana = [T Bs Crag? — m) ema! (ntn-Z n) Q(B), (852) 
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cf. Eq. (2.55). 

As already discussed in Section 2.2.5, there typically are additional complications 
when evaluating cross-sections at higher orders beyond merely having to calculate 
more and possibly more complicated diagrams. First of all, the loops in the virtual 
contribution introduce a new integration variable, the four-momentum in the loop l, 
which is not constrained by overall four-momentum conservation or on-shell conditions 
encoded in the 6 functions, but instead can take any value in any of its components 
allowed by four-momentum conservation in the interaction vertices. In particular, the 
four-momentum in the loop can become infinitely large; for expressions of the general 
form d‘41/l™ this will potentially yield infinite results if the power of the loop momen- 
tum m < 4. Such ultraviolet divergences have to be regularized, which usually is 
achieved by employing dimensional regularization, corresponding to integrating in 
D =4-—2e rather than in four dimensions. The benefit of this method is that it can be 
formulated in such a way that Lorentz and gauge invariance are guaranteed which is 
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not the case for other, traditional methods such as a regularization through a cut-off 
or through the Pauli—Villars method. In dimensional regularization the ultraviolet 
divergences manifest themselves as simple poles 1/2. Having thus “quantified” the de- 
gree of divergence the theory is renormalized through a suitable, scheme-dependent 
redefinition of the Lagrangian, adding suitable counterterms. Reviews of this formal- 
ism may be found in most of the many excellent textbooks on quantum field theory. 
However, some generic methods to evaluate the new structures introduced by the loop 
integration are discussed in Section 3.3.1. 

However, in addition to ultraviolet divergences infrared divergences appear, both 
in the virtual and the real correction. In both cases they are related to emissions, 
either in the loop in the virtual correction or in the additional particle radiated in the 
real correction, where the additional particle has zero energy or is parallel to another 
particle. This has already been discussed in Section 2.2.5, where it was also pointed out 
that due to the Bloch—Nordsieck (BN) and Kinoshita-Lee—Nauenberg (KLN) 
theorems [257, 678, 724] these divergences need to cancel each other in physically 
meaningful observables. 

Again, in order to deal with the infrared divergences, they have to be regularized, 
with dimensional regularization as the method of choice, as in the case of ultraviolet 
divergences and for the same reasons. This time, however, the divergences manifest 
themselves as double or single poles, 1/e? or 1/e, respectively. As already implied, 
these poles do not need to be renormalized since they will cancel in the total result. 

There is a practical problem in the cancellation, though. Inspecting Eq. (3.49), 
these poles show up in two different parts of the calculation, and in two in principle 
independent integrations: for the infrared divergences in the virtual contribution, they 
appear in the integral over the n-body phase space associated with Born configura- 
tions, while for their counterpart in the real contribution, they of course appear in the 
(n+1)-body final state. Even if these integrations could be performed analytically, they 
are cumbersome to trace in D dimensions. This will be exemplified in Section 3.3.2, 
where NLO corrections to W production are discussed. In the more general case, as al- 
ready noted the phase-space integration has to be performed numerically, with Monte 
Carlo methods. In this case it is impossible to integrate in D dimensions, and other 
methods have to be found to isolate the divergences before the integration can be 
successfully achieved. Typically this is by now achieved through subtraction meth- 
ods, introduced in Section 3.3.2. A general subtraction method, Catani-Seymour or 
dipole subtraction [344, 353], is discussed in Section 3.3.3. 


3.3.1 Evaluating virtual corrections at one-loop 
3.3.1.1 The simplest one-loop amplitude: The Drell-Yan process 
In order to understand some of the subtleties of one-loop calculations, it is instructive 


to return to the simplest case, Drell-Yan production, and go through the calculation 
in some detail. 
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The formula for the virtual amplitude is, cf. Eq. (2.116), 


D vp 
ME iya = owen f E {5 oloa [r E yet Beak oy 888) 


k2 (pa Fk)! (pu =k” 
Pat# Pa uL üb Bu Pu —f ü € 
1% atk a? TY Y pu kF | (Pu) pw} 


The contribution of the vertex correction, represented by the first term in square 
brackets, is in fact the only lengthy calculation that needs to be performed. This term 
can be written as 


D + 
E ap f ak — Vee, (W+) ; 
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where the current V“ is 


VE = 0(pa) w Ba +H)” Wu — H) ulpa). (3.55) 


This expression is readily simplified with a little y-matrix algebra to the form 


V” = v(pa) |-2 (Bu — H)? Ba +H) + 2a (Pg +H)” (B,, — H) fu(pu). (3.56) 


This equation introduces a constant, aCP®, that specifies the variant of the regular- 
ization scheme to be used in the calculation. In conventional dimensional regu- 
larization all quantities are continued into D dimensions such that yy” = D = 4—2e. 
This corresponds to the choice a@P® = 1. In dimensional reduction, only the 
loop momentum is continued into D dimensions, and then y9” = 4 implying that 
aCPR = 0. 

In order to evaluate the loop integral, it is simplest to combine the loop propagators 
in Eq. (3.54) by introducing Feynman parameters, here x, y, and z.° Applying the 
identity Eq. (A.27) to the case at hand yields, 


1 2ô(1 — x -— y -— z) 
2 EZEZ -|/ dz dy dz 2 2 213 
(pa + k)? (pu — k) 0 [c(pa + k)? + y(pu — k)? + zk?] 
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where one of the Feynman parameters has been eliminated immediately using the 
6 function. In a second step, the denominator has been simplified with the on-shell 
conditions for p, and pg. It is now useful to shift the variable of integration from k 
to € with the relation, k = €— xpa + ypu. After this shift the denominator takes the 
simple form (¢? + Q?xy)?, where Q? = 2pq- py as usual. 


5Feynman parameter identities are discussed in Appendix A.1.3 
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At first glance, performing the same shift on the current V“ leads to a proliferation 
of terms. However, since the denominator is an even function of £, any odd powers of £ 
may be dropped in the numerator since they will vanish upon integration. Moreover, 
the integral of the term €°@° must be proportional to g°° and contracting with Jab 
fixes the overall constant. It is therefore sufficient to replace 


£008 — g% 02 /D, (3.57) 
under the integral. Finally, the equations of motion for the spinors, U(pa)pa = pulpu) = 


0 greatly simplify many of the resulting expressions. 
After this simplification the integral takes the form 
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where, 
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N =Q [(1—2)(1—y) — ea PR ay] — P (1 — aS PRe)(1 — 2/D). (3.59) 
As a consequence of the spinor structure the vertex correction is proportional to the 
leading order amplitude. The integrals over the loop momentum are now easily per- 


formed by using a Wick rotation, for which the general result is given in Eq. (A.33). 
The integrals appearing here are 


d?e e i D 52 a 
J (2n)P (2+ Q2ay)8 (PA (5 ) r(e) [-Q*ay] 


a?e 1 i 1 PE a 
| ewe (24+ Q2xy)3 ~ — (4n)P/? G) Pikea] 


Hence, 


mP (+ a = aP -@) * 


x {2 [(1 — x)(1 — y) — ca PR ary] ay 7!-* — 2(1 — a PRe)(1 — e) Lge) 


Í are 4N (1 +e) 
( 


The integral over y can now be performed in a straightforward manner. The remaining 
x integral is immediately in the form of a beta function that can in turn be expressed 
in terms of gamma functions, cf. Eq. (A.6). After manipulating these and dropping 
terms of order € the final result for the integral is 
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Í dk ve 
(27)? k?(pa + k)? (pu — k}? 

-QETA +M- E) [2 3.7, cor 
(m) TU- 2) E eee | 


4n\* 1 1 AE: 
aonan uL CDR 
iU(pa) Y" u(pu) ( =) le? 0A | PRN k 


In order to arrive at the last line it is necessary to use the relation in Eq. (A.4) to 
simplify the combination of gamma functions that naturally occurs. 

Restoring the overall factors present in Eq. (3.54) and extracting the leading order 
amplitude, the result for the vertex correction contribution to the amplitude is 


= ö(pa) "4 u(pu) 


E 
(vertex) __ (0) Qs Any? 1 2 3 CDR 
Muaswt = Muaswe Cr ( a) face a ee | 661) 


It is common to identify the factor 
cr = (4r) /T(1 — £) = 1 +e (1 + log(4r) — yg) + O(e?), (3.62) 


which is a universal factor in one-loop calculations. In addition, for the Drell-Yan 
process, the kinematics require Q? > 0 so that it is also necessary to expand the factor 


(—1)€ = 1 + ire — r’e?/2 + O(e°). (3.63) 


Putting this together yields the final expression 


2 E 
mE = MO Burgi a cr 


ud=>W+t ud>Wwt Ar Q? 
2 2 
x | 5 3 _7_ gCDR 4 9? in ( +3)| (3.64) 
€ € € 


This amplitude enters the cross-section in the virtual correction term V as 


ud—+Ww+ ud—>Ww+ 


V = ame [MG m*® |; (3.65) 


cf. Eq. (3.50). 

In conventional dimensional regularization, with a©P® = 1, this thus leads to the 
result for the full virtual contribution already given in Eq. (2.117) in Section 2.2.5. 
To complete the derivation it is therefore only necessary to demonstrate that the 
contribution of the self-energy corrections is zero. Repeating the same steps as above 
for the self-energy contributions, the result is proportional to the value of the integral, 


de il 
| oa e eee 
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Fig. 3.4 A pentagon diagram entering the loop amplitude for Z + 2 jet 


production (left) and the corresponding et e~ — 4 jets diagram from which 
it can be obtained by crossing (right). 


However, since this integral has dimension D — 4 but contains no dimensionful quanti- 
ties with which to express the result, it must vanish. As a result, self-energy corrections 
on massless external lines are zero within dimensional regularization. 


3.3.1.2 General approach to one-loop amplitudes 


Although the previous example illustrates the basic concepts of loop-integral calcula- 
tions, this technique for evaluating the amplitude does not extend to more complicated 
cases. The calculation is particularly simple for a number of reasons. The one-loop am- 
plitude is represented by a single diagram that contains three external legs, all of the 
propagators are massless and as a result the calculation is relatively simple. For pro- 
cesses involving more particles in the final state the number of diagrams, and their 
complexity, grows substantially. For instance, the calculation of V + 2 jet production 
at NLO requires the evaluation of 5-point diagrams such as the one shown in Fig. 3.4 
(left). In general, loop diagrams often are referred to as n-point diagrams, where n 
is the number of loop propagators. Alternatively, a nomenclature referring to them as 
tadpole (1-point), bubble (2-point), triangle (3-point), box (4-point), pentagon 
(5-point), or hexagon (6-point) diagrams, is often used. The example depicted in the 
figure would therefore constitute a pentagon diagram. 

One might expect the computation of diagrams involving such loops of virtual 
particles to be the limiting factor that determines the complexity of calculations that 
can be performed at NLO. Indeed for a long time this was the case, and the efficient 
treatment of loop diagrams continued to be the bottleneck of NLO calculations for 
multi-particle processes. The nature of such calculations is exemplified by the compu- 
tation of a single one-loop diagram that contributes to the process of V+jet production 
at hadron colliders, depicted in Fig. 3.5. In this example all momenta are labelled as 
outgoing, which is a particularly convenient notation since momentum conservation 
is represented by a simple sum, pı + p2 + p3 + p4 = 0. This of course implies that 
p4 = —pı23 = —(pi + p2 + p3) in obvious nomenclature. With this very symmetric 
notation it is also easy to manipulate the matrix element to obtain the result for any 
crossed process with some other external particles in the initial state. Basically, 
incoming and outgoing particles are related to one another by conjugation which ef- 
fectively amounts to the inclusion of complex phases (“barring”). This means that 
computing an amplitude with an outgoing particle of a given chirality also provides 
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T pi 


—> —P123 


J p3 


Fig. 3.5 Example one-loop diagram contributing to V+jet production. 
All external momenta are outgoing and the loop momenta labelled on 
the internal line flow clockwise around the diagram. The arrow on the 
fermion line indicates the direction of particle flow, so that pı represents 
an outgoing quark. The dashed blue lines indicate propagators that may be 
put on-shell when computing integral coefficients using the OPP procedure 
of Section 3.3.1.3. 


the corresponding amplitude for an incoming anti-particle of the opposite chirality. 
For some of the more technical details of this procedure the reader is referred to 
Section 3.2.1. 

Hence, as written, the diagram in Fig. 3.5 corresponds to the amplitude 


0 — q(p1) + g(p2) + (p3) + V (—p123), (3.67) 


but may equivalently also describe 


V (p123) > q(pi) + g(p2) + g(p3), (3.68) 
@(—pi) + a(—ps) > g(p2) + V (—p123), (3.69) 

or 
q(—p1) + g(—p2) > q(p3) + V (—p123). (3.70) 


The behaviour of this crossing operation, when applied to the leading-order ampli- 
tudes, can be verified in Eqs. (2.97) and (2.98). Note that, similarly, the amplitudes 
for pp + V + 2 jet production, cf. Fig. 3.4 (left), can be related to those for four-jet 
production in e*e— collisions, cf. Fig. 3.4 (right). 

The matrix element corresponding to the diagram in Fig. 3.5 reads 
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de f {+ Bro 1 
los alpi) In Ge Yo Ot pina)? 3) yet pa (3.71) 


x VE”P(L + pı, 8 — p12, p2) €p(p2) J7, 


where J° represents the external vector boson current and V“”? is the triple gluon 
vertex factor, 


Vere (e+ pi, —£ — p12, p2) = (20° + 2p? + ph)g” — (C + Qe + pi')g”? 
+(pg — pi — C )g”’. (3.72) 


Inspecting this expression, the amplitude can be described in terms of a basis set of 
loop integrals, categorized according to the number of loop-momentum factors that ap- 
pear in the numerator. This number is referred to as the rank of the tensor integral 
and integrals without any additional numerator factors are usually called scalar inte- 
grals. One way of evaluating this contribution, for a long time the standard method, 
is to separately tabulate integrals of each rank. Contracting the Lorentz structures in 
the numerator with external momenta and polarizations produces the matrix element 
in Eq. (3.71) Inspection of Eq. (3.71) reveals that, for the case at hand, tensor integrals 
of up to rank 3 are required, 


p 1, L”, LLP LPO 
le s es i (3.73) 


Qn)P (L+ pi)? (l+ pia)? (l + pias)? 


For the simplest — the scalar — integrals, numerical libraries for their evaluation are 
now widely available for up to 4-point functions [499, 879, 881]. In fact these libraries 
are sufficient for all one-loop calculations since the more complicated scalar integrals, 
pentagons and beyond, can be written in terms of linear combinations of scalar box 
integrals. However the integrals with additional tensor structure are more complicated 
to evaluate. A number of systematic methods for reducing them to the form of scalar 
integrals do exist, the most widely used of which is the Passarino and Veltman 
reduction method [798]. The basic method can be illustrated by considering the 


integral 
D 
I” a a - , (3.74) 
(27)? P(E ++ pi)?( + p12)? (£+ p123)? 


Since the integral I” can only depend on the momenta that appear in the denominator, 
the Lorentz structure can be decomposed as 


I" = pt Di + p$ D2 + p$ Ds. (3.75) 


The problem is then immediately reduced to determining the coefficients D1, D2 and 
D3. Contracting I” with each of the external momenta in turn gives a system of three 
independent equations that can be used to solve for the three unknowns, 


I-py O pi: p2 pi-p3 Dı 
I-p | = | pi-p2 0 p2:p3 Də |. (3.76) 
I- p3 pı: P3 p2°p3 O D3 
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Now, for example, contracting each side of Eq. (3.74) with p3, results in 


ise =;/ de [(€+ p23)? — L — vias] — [(¢ + p12)? — 2 — pio] 
-p3 = 


2 J (2m)? L(L++ p1) (l+ pro)? (C + p123)? 


_li d?e 1 1 
=3/ ome (C+ pi)2(€+ pie)? (E+ pr)?(E + Pras)? (3.77) 


2p12 ` P3 
L(L++ pi)?(€+ piP ++ piz) | 


The quantity I - p3 is thus expressible in terms of a combination of scalar integrals, as 
are Í - pı and I - p2. The complete integral I”, and in a similar manner I#”, [#”?, ... 
can all be written in terms of scalar integrals in this way. For rank 2 and beyond, the 
Lorentz structure must also include the metric tensor. For instance, the most general 
rank 2 box integral can be decomposed as 


e J pip Du + > (viv; + pho) Dij + 9" Doo. (3.78) 
i ij 


The complication in this approach stems from the inversion of the matrix relation 
in Eq. (3.76). The determinant of this matrix that is introduced in solving for the 
D;, the Gram determinant, is an artefact of the computation introduced by the 
expansion in terms of scalar integrals. The matrix element itself contains no singularity 
in the limit in which these determinants vanish, yet one inverse power of a determinant 
is introduced for each loop momentum factor present in the numerator of the original 
integral. This redundancy leads to expressions for individual Feynman diagrams that 
can not only be very lengthy but also be affected by significant numerical cancellations 
between terms. 

It is possible to reformulate the reduction in a number of ways in order to alleviate 
problems such as Gram determinant singularities. To see how one such solution works, 
consider a triangle integral with p? 4 0 and pł 4 0. The basic scalar integral is, 


d? gr 


Copp) = | oop DES EET (3.79) 


and the rank 1 tensor integral C” can be decomposed in similar fashion to the box 
example above, 


i dPe ou 2 o 
AE (27)P €2(€+ p1)? (l+ pia)? Cipi + Capa: (3.80) 


The matrix equation for the reduction reads, cf. Eq. (3.76), 


C -pı pi E & 
= 3.81 
es a p Caj’ (3:81) 
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and the explicit expressions for the quantities that appear on the left-hand side are, 


Cop = ; [Bo(p12) — Bo(p2) — p{Co(p1, p2)] , (3.82) 
C- pa = 5 [Bolp1) ~ Bo(pia) + @? — p?2)Co (p1, P2) , (3.83) 


in terms of the scalar bubble integral Bo(q) which is defined by 


dPe 1 
Bo(q) - | op EEA (3.84) 


The direct solution of Eq. (3.81) is, 


Cı 1 p3 En oa 
= — F 3.85 
a n pi C- po (aaa) 
where A = p?p2 — (pı - p2)? is the Gram determinant. Rather than following this path 
1P2 


to the solution, note that multiplying the top and bottom rows of Eq. (3.81) by p3 
and (—p, - p2) respectively, and adding, yields the equation, 


(C - pi) p2 — (C - p2) pı - p2 = [pips — (pi - p2)°] C1 = ACh. (3.86) 
From the explicit solutions for C - pı and C - pọ given in Eq. (3.82) this equation 


becomes, 


1 
AC, =—-5; [Pips + Pi - P2(P} — Pj2)| Co(pı, p2) + {scalar bubbles} , (3.87) 
where, for brevity, the exact form of the linear combination of scalar bubble integrals 
has been suppressed. After rearranging this yields an expression for the scalar triangle 
integral, 


2 
a [a Cı + {scalar bubbles} |. (3.88) 


Co(p1, p2) =. 
D1 + po(p2y — pi) — pips 


This equation indicates that, in the limit that the Gram determinant vanishes, the 
scalar triangle integral can be written as a sum of bubble integrals. Note that the 
same conclusion could have been reached more directly by noting that, in this limit, 
pı and pə are collinear and that the correct expansion of C should therefore be, 


CH = Crp! (3.89) 


However, the advantage of Eq. (3.88) is that it provides the O(A) correction to this 
relation, if the quantity C1 is known. The same pattern is reproduced at higher-tensor 
ranks so that, for instance, 
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2 
Ci = 


= A Qij Ci; + {up to rank 1 bubbles, Co}|, (8.90 
a GP) PR D OE en 


i,j 


where Cj; are rank 2 reduction coefficients and so on. This means that if the rank r 
bubble integral is already determined, the rank r triangle integral can be determined 
up to corrections of order A. This suggests an iterative approach to the problem, that 
begins by using Eq. (3.88) with A = 0 to determine the first approximation to Co. 
With this in hand, Eq. (3.90) can be used with A = 0 to determine C; for the first time. 
At this point Eq. (3.88) can be used once more to refine the value of Co, which is now 
accurate to order A. This procedure involves denominators such as the one shown in 
Eq. (3.88), which in general do not vanish at the same time as the Gram determinant. 
However, it may be that the coefficients appearing in this iterative scheme are such 
that the procedure does not converge. Alternative reduction methods to handle such 
exceptional cases, together with explicit algorithms for reductions of up to six-point 
integrals, are given in Ref. [456]. 

Despite the existence of libraries implementing the sort of rescue procedures de- 
scribed above, this approach still becomes more cumbersome as the number of particles 
in the final state increases, simply due to the rising number of Feynman diagrams that 
must be computed. Nevertheless, such methods have been pushed to their limits in the 
build-up to the LHC era, providing NLO predictions for final states such as bbbb [253], 
W+W-~bb [459] and tébb [242, 284]. 


3.3.1.3 Reduction at the integrand level: OPP method 


A very powerful method for obtaining one-loop amplitudes relies on a different re- 
duction technique that works at the integrand level and naturally takes advantage of 
the powerful leading-order amplitude techniques discussed previously. To see how this 
works, consider again the V+jet process represented by Eq. (3.67). 

This process depends on three independent four-vectors, pı, p2, and p3. In what 
follows it will be convenient to parameterize a general four-vector — the one that 
defines the momentum running round the loop — in terms of the vectors that lie 
in the physical space spanned by pj, p2, and p3, and the transverse space that can be 
spanned by a single vector, n. For the physical space, rather than using the momenta of 
the particles themselves, it is more convenient to use the van Neerven—Vermaseren 
basis, v1, v2, v3, defined by 


kak kı uks ki kop 
hb ky kok3 H ki kokg H kık2k3 
v = ’ Ug = A ’ U3 A ’ (3.92) 


A 


where kı = pi, k2 = pı + p2 and k3 = pı + po + p3 are the momenta that naturally 
appear in propagator factors of loop diagrams. The Kronecker delta appearing here is 


6To show this identity requires some further work to demonstrate that the coefficient Coo can be 
written in terms of bubble integrals and a scalar triangle, plus rank 2 triangle coefficients of order A. 
This is indeed the case and, for instance, one can deduce the relation, 


4AC22 + 4(D — 1)p?Coo + płCo(p1, p2) = {bubbles} . (3.91) 
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a shorthand notation with the following definition, 
ki ká key 
Oki koky = | KI k3 k3 |, (3.93) 
ke KÊ ke 
and the factor A is the corresponding Gram determinant, 
A =S. pa kiukovksp = Ôk pek. (3.94) 
These momenta satisfy the orthogonality condition 


The vector n in the transverse space can be defined by 


i eukikaks 
nt = ——, 3.96 
VZN ee 
where the epsilon tensor ensures orthogonality 
Ni kj = ni: Vj = 0, (3.97) 


and the normalization is such that n? = 1. The properties of these vectors make it 
straightforward to see that a four-dimensional loop momentum can be expanded as 


3 
oH = S (6+ kiju} + (C n)n". (3.98) 


i=l 


This is a particularly useful form for the following reason: the diagram shown in 
Fig. 3.5 contains four propagators do,...,d3, given by (assuming massless quarks) 


dg =, di =(€+)?, for i= 1,2,3. (3.99) 


The dot products that appear in the loop momentum decomposition can thus be 
re-expressed as 
1 1 
l- ki = 5 (di — do) — uk (3.100) 
The most complicated term in the evaluation of this diagram contains three powers of 


the loop momentum in the numerator and can be treated by realizing that 
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3 
WUE = Tee (Za — do — k?)u? + (£- nn’) 


i=l 


3 
1 Vv V 
= 3% | È (t — do — Koy + (C n)n 
j=1 
3 
x (- 5 keue + (L. ny + triangles 
i=1 


t 


3 3 
(- 5 k2 vit, + (e- ny) — 5 kev + (€-n)n” 


j=1 


3 
x (- 5 k2ue + (4 ny + triangles, bubbles 


i=1 
= OHP + HYP (L. n) +85. (E- n)? + dk”? (£- n)? + lower points. 


In other words, the integral can be manipulated into a very particular form. It can 
be represented by coefficients of a scalar box integral (ôo) and tensor integrals that 
are still of rank 3, but where the loop momentum is contracted with the transverse 
vector, n. The remaining terms are all integrals in which at least one propagator has 
been cancelled. 

One crucial simplification remains. Contracting the loop momentum in Eq. (3.98) 
with itself yields the relation 


3 
C= So (E+ ki)(E- ky)us vy + (En)? 
ee d (3.101) 
1 
> (l-n)? = do — = 5 (di — do — k?) (d; — do — k?)u; + vj 
i, j=1 


This means that a factor of (¢-n)? in Eq. (3.101) may be replaced by a constant, up to 
terms in which a denominator has been cancelled and that the results thus represent 
lower-point integrals. Redefining 69 and 6, suitably to absorb the additional constant 
terms, the rank 3 integral can thus be parameterized most simply by 


cee gr — 5h”? + ot”? (L n) + lower points (3.102) 


After integration, the ¢-dependent term cannot contribute to the result since the inte- 
gral will only produce contributions of the form k;: n = 0, thus vanishing by definition. 
Therefore the rank 3 box integral has been reduced to a scalar box integral, together 
with lower-point integrals of rank at most 2. 

A similar line of reasoning can now be followed for the lower point triangle and 
bubble, integrals. In these cases the parameterization of the loop momentum becomes 
more complicated since additional transverse directions are required in order to span 
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the four-dimensional space of loop momenta. However, the basic line of reasoning 
follows and, in similar fashion, these integrals can also be reduced to their scalar coun- 
terparts. This allows the entire diagram to be reduced to a simple sum of coefficients 
multiplied by basic scalar integrals at the integrand level 


{ULP Awp +...} a a) aa 
D — = L a Wath, s 
dod, dod3 Cader 2a I3 + 2 I; (3.103) 


In this equation the n-point scalar integral is denoted by O, where the box prop- 


agators that have been cancelled are labelled in the superscript. This decomposition 
is at the heart of the approach to computing one-loop amplitudes that was originally 
formulated by Ossola, Pittau, and Papadopoulos [792], now commonly referred to as 
the OPP method. 

It is convenient to rewrite this decomposition one more time, with the cancelled 
propagators in the lower-point integrals explicit, 


1 (i) Gij) 
D => ——— 4 ý Zar . . 
aids c4 + i c3” di + J cy” did; (3.104) 


tj 


If a particular loop momentum could be chosen such that d; = 0 for all 7, then all the 
terms on the right-hand side of Eq. (3.104) would vanish except for c4. Therefore the 
coefficient c4 can be determined by evaluating the diagram for this special value of 
the loop momentum. The form of the loop momentum that satisfies these additional 
constraints is already determined by Eq. (3.98), Eq. (3.100), and Eq. (3.101). Setting 
all the propagators to zero in these expressions, the loop momentum reads 


1 1 
mas So ke uf + 5 NO kk agg | në. (3.105) 


Evaluating the expression for the diagram, D, at this particular value of 4” yields 
the box integral coefficient c4. With this coefficient determined, Eq. (3.104) can be 
rewritten as 


Ca 1 (i) Gij) 
= di D didy 3.106 
uda ae e 3 w eee) 


to clarify the strategy for determining the triangle coefficients. Another parameter- 
ization of the loop momentum is required, which differs in important respects from 
Eq. (3.98). Importantly, for the triangle case, there are only two independent momenta 
so that spanning the physical space requires two transverse directions nı and ng, 


2 
œ =X (0 kijut + (C-n )ni + (E na)nh, (3.107) 


i=l 


Technology of next-to-leading-order calculations 133 


where vı and v2 are now redefined appropriately. Despite this difference, the essential 
strategy remains the same: use an appropriate parameterization of the loop momentum 
that allows the reduction of tensor integrals to the scalar case, then demand that it 
also satisfy d; = 0 for the three propagators that appear in the integral. With this 
choice, choosing for instance dı = dz = d3 = 0 in Eq. (3.106) singles out the coefficient 
W., Repeating this procedure, at the level of bubble and tadpole coefficients, where 
appropriate, eventually leads to values for all of the integral coefficients appearing in 
Eq. (3.104). For further details, the interested reader is referred to Refs. [501, 792]. 

At this point two important points must be stressed. The first is that by setting 
various combinations of propagators to zero, the integrands actually reduce to com- 
binations of on-shell tree-level amplitudes. Combining this approach with the efficient 
methods for computing such amplitudes discussed in the previous section has yielded 
powerful new tools for computing NLO corrections in an automated fashion, far be- 
yond the limits of purely analytic calculations. These techniques are equally adept at 
handling massive particles propagating in the loop and the total particle multiplicity 
is limited purely by computing power. Examples of such numerical codes are listed 
below. 

The second point is that the discussion presented above neglects an important 
complication. The discussion was rooted in four dimensions, which is sufficient to 
determine all of the contributions proportional to scalar integrals. However, when 
working consistently in d = 4 — 2e dimensions, the algorithm presented so far must 
be extended. Explicitly, the triangle loop momentum decomposition is now described 
not only by the transverse vectors nı and nz but also by the unit vector in (—2e) 
dimensions, Nne, 


th = S (U kiyo} + (E+ nn + (E no)nh + (€- nent. (3.108) 


i=l 


While at first glance this seems like a rather small change, it results in an expression 
for the decomposition of a general tensor integral that is more complicated than the 
form given previously. In particular, the relation analogous to Eq. (3.101) is, 


1 3 
(l-n)? + (L ng)? + (€- ne)? = do — i XO (di — do — k?) (d; — do — k? )vi vj (3.109) 


ij=1 


so that the reduction of a triangle integral of rank at least 2 leads to integrals with 
(L - ne)? numerators. The contribution of such integrals is straightforward to evaluate 
by Passarino—Veltman reduction, 


(l-n 
o re) _ [GuvCoo] = (-22)C 3.110 
[PEPE = mint [gy Coo] = (-22)Con (3.110) 
where most of the components of the integral vanish by orthogonality, but the term 
proportional to the metric tensor, Coo, survives (cf. the equation defining the box inte- 
gral counterpart of Coo, Eq. (3.78)). Since Coo contains an ultraviolet 1/e singularity, 
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this integral thus gives a non-zero contribution. These additional terms are called 
rational parts and additional work is required to determine their contributions. Al- 
though this is an important aspect of the procedure, it is not conceptually different 
from the strategy outlined here and does not require much more computational effort. 


3.3.1.4 Reduction at the integrand level: OPENLOOPS 


The idea underlying the OPENLOOPS algorithm [338] is to combine tensor integral 
reduction and OPP methods to generate one-loop amplitudes based on a recursive 
construction of Feynman diagrams. To see how this works in more detail, consider the 
colour-stripped n-point one-loop diagram 


dg N (Tn; 
ees DD, nA oe, 
which connects the n-ordered subtrees 
Dye iia aien ta} (3.112) 
with n loop propagators 
Di = (q+ pi) — m? + ie. (3.113) 


All other propagators, the ones in the subtrees, as well as all numerators of the prop- 
agators and all structures from vertices and external particles are absorbed in the 
overall “numerator” M, which effectively is a polynomial in the loop momentum q. In 
gauge theories, its rank R usually cannot be larger than n: R < n, which allows to 


write 
M1 H2---Hr 


R 
N (Ens 1) = X, N Ea Enq gh? aag. (3.114) 
r=0 


This leaves the following generic one-loop tensor integrals to evaluate, 


dq git qt? ... gin 


Tiber = 
nr 
DoDi... Dai 


(3.115) 


As already discussed, in traditional tensor-reduction techniques this is achieved by 
reducing the tensor integrals to scalar ones. Technically this reduction results from 
analytically cancelling the momenta q in the numerator with the denominators D. 
Alternatively, the OPP method numerically expresses the numerator as a polynomial of 
the denominators, where the residual scalar integrals and their coefficients are obtained 
by solving systems of equations stemming from simultaneous multicut relations D; = 
D; =... = 0 for up to four different propagators at the same time. 

Setting po = 0 eliminates all momentum shift ambiguities and singles out Do. 
Cutting the propagator Do and removing the other loop denominators D; leaves the 
numerator term MÊ (Zn; q). Pictorially, it can be reconstructed through the recursion 
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(in) 
NE LAG = | z = 


or, written out, as 


NE (Tn; d) = NEG Sa 


fs (Tns in, Ip Aye (in). (3.116) 


Here, xe 5 and w® are the vertices and subtrees that enter the tree algorithm. The 
former are similar or identical to the vertices Vą used in the off-shell recursion rela- 
tions, cf. Eq. (3.14), while the latter are different. In contrast to the currents in the 
off-shell recursion relations that capture all possible combinations of given subsets of 
external particles, the wô are recursively constructed from Feynman (sub-)amplitudes. 


Pictorially, 


Xli, J, k) w (j) w (k) 


pP? -— m? +ie 


or 


w? (i) Z 


(3.117) 


with polarization vectors or spinors being the wave functions representating external 
vector bosons or fermions. Expanding” the vertex functions in orders of q”, 
xis = Vis +q” Zi (3.118) 


similar to Eq. (3.114), allows to decompose the recursion for the numerator terms in 
Eq. (3.116) to be written as 


Nauna dn) T [a ey E Oa’) + z, yë Nienna Tn-1) +] w (in). 
(3.119) 
The number of coefficents for this kind of decomposition grows polynomially, with a 
degree determined by the tensorial rank r of the original numerator term. Recycling 
subtrees, symmetrization over open-loop tensorial indices 41, H1, ---, Hr, “pinching” 


TĪn the equation below, only Feynman rules from gauge theory have been assumed, limiting the 
expansion to first order in the four-momentum; including also effective theories such as the Higgs 
Effective Theory with its more complicated vertex structure renders the inclusion of second-order 
terms necessary. 
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propagators, and thereby factoring out common factors Zn further enhances the ef- 
ficiency of this algorithm. Once the coefficients of the decomposition are known, the 
polynomial in Eq. (3.114) can be evaluated multiple times at very low CPU cost, 
thereby yielding a massive acceleration of the overall calculation. As a further benefit, 
the OPENLOOPS algorithm can be used in conjunction with both OPP and traditional 
tensor reduction methods or any admixture, which enhances it even more. As a con- 
sequence, after its initial implementation in the OPENLOOPS package with a first non- 
trivial application in [337], the algorithm has also been adopted by the MADGRAPH 
collaboration [148]. 


3.3.2 Dealing with infrared divergences: General ideas 


In Section 3.3.1, methods to calculate the virtual correction V in the expression for 
cross-sections at next-to-leading order, cf. Eq. (3.49), have been discussed. As already 
stated, this term exhibits two kinds of divergences: ultraviolet and infrared ones. While 
the former are confined inside the virtual correction and can be dealt with there 
through regularization and renormalization, the latter are a bit more tricky. This is 
due to the fact that these divergences, the infrared ones, must cancel between virtual 
and real contributions, due to the Bloch—Nordsieck (BN) and Kinoshita—Lee— 
Nauenberg (KLN) theorems [257, 678, 724]. This cancellation has already been 
observed in the previous chapter, for the example of inclusive W production at hadron 
colliders, cf. Section 2.2.5. Direct calculation of the virtual contribution in this example 
resulted in the terms exhibited in Eq. (2.117), while the direct evaluation of the real 
correction term, including an integration over the two-body final-state phase space 
resulted in Eq. (2.118). It is important to stress here that the exact cancellation of 
the infrared divergences in this case was only recovered upon integration over the 
respective phase space of the final-state particles. 

With more complicated processes, this way of directly calculating the matrix ele- 
ments and performing phase-space integrals to cancel the divergences very quickly ex- 
hausts its applicability. This can be traced back to the fact that, in general, final-state 
phase-space integrals for final states with more than three particles cannot analytically 
be evaluated, neither in four nor in the even more complicated case of D dimensions. 
For such cases numerical methods — Monte Carlo integration techniques — must be 
invoked, which by construction only work in an integer number of dimensions, another 
approach to deal with infrared divergences and their mutual cancellation is manda- 
tory. By now the method of choice is infrared subtraction. The underlying idea is to 
isolate the divergent structures through suitable terms such that Eq. (3.49) becomes 
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(3.120) 


where the real subtraction term Sn, which lives in the (n + 1)-particle phase space 
of the real correction, and the integrated subtraction term TS), which lives in the 
(n)-particle phase space of the Born and virtual contributions, cancel each other in 
the integrals 


i= f 8T Osi ue, ne) = [de Sa(nine.se). (3.121) 


The catch in this method is that due to the universal structure of infrared 
divergences in gauge theories it is possible to construct subtraction terms S in a 
process-independent way, guaranteeing that the difference (R — Subtraction) is finite 
in every single phase-space point in Pr. Even more, these terms can be constructed as 
the product of some Born-level configurations times some individual terms, accounting 
for the emission of an additional particle. There terms can be integrated over the 
phase space of the additional particle in D dimensions, yielding infrared divergences 
that manifest as terms proportional to 1/e? and 1/e. Before, however, turning to 
the explicit construction of these terms in a specific formalism, Catani-Seymour 
subtraction [353], the underlying idea will be elucidated in a toy model and applied 
to the by-now familiar example of W production at hadron colliders. 


3.3.2.1 Infrared divergences: A toy model 


In the toy model, assume that Born, virtual, and real emission matrix elements squared 
to O(a) read 


Ba = SIM, 


V, i 
Pa = A MME, (3.122) 


Raa) = O PD MM e)p, 


where both the Born and the virtual contributions are constant in the toy model and 
the latter has already been regularized in D dimensions and its ultraviolet divergences 
have been regularized and renormalized. The remaining infrared divergences then lead 
to the pole in 1/e, which has been made explicit in the virtual contribution — here, 
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in the toy model, V, indeed is infrared finite and ultraviolet renormalized. The real- 
emission part in the toy model depends on a one-dimensional phase-space parameter 
x € [0, 1] and it diverges for x — 0, producing a single pole 1/e, as again made explicit, 
while the function R,(x) is regular in the full interval. 

Written in these quantities the NLO cross-section is given by 


1 
NEO) — [Bn + Vn] rif dz Rn(x) Fr, (x) 


° (3.123) 


1 
= [e+] els f Eror), 
0 

where F7 is a (jet) criterion applied in order to ensure that the Born-level cross- 
section and the corresponding kinematics are free of singularities. The concept of 
jets has already been introduced in more detail in Section 2.1.6. Here, it suffices to 
remember that in the language of QCD the application of a jet criterion implies that 
the m outgoing particles are all sufficiently energetic and well separated from each 
other such that the Born cross-section is free of infrared singularities. In fact, this jet 
criterion could be replaced with any observable that ensures that the Born-level part 
is well behaved. 

Typically, when adding virtual corrections, no new phase-space regions become 
available because the incoming and outgoing particles are still the same as at the 
Born level. Therefore, the function F can be applied without much ado also to the 
virtual part. This is not true when adding additional real radiation. In fact, infrared 
divergences appear; in the toy model they are represented by the 1/x-poles. In order to 
deal with them the observable F7 must be defined in such a way that soft or collinear 
emissions — those with x — 0 in the toy model — do not affect it. This catches 
the essence of the intuitive definition of an observable being infrared-safe: it must be 
defined in such a way that further soft and/or collinear emissions do not affect it. 
Mathematically speaking this means that 


lim Fya() = Fs (0) = FY. (3.124) 
It is important to stress, though, that without infrared safe observables the full concept 
of higher-order calculations becomes entirely meaningless. 

The Bloch—Nordsieck and Kinoshita-Lee—Nauenberg theorems [257, 678, 724] now 
state that for infrared-safe quantities the infrared divergences in the real and virtual 
contributions cancel, which implies that 


lim Rn (2) = R,(0) = V. (3.125) 
In the framework of NLO calculations two systematically different methods have been 
developed to isolate the pole in the real-emission contributions, namely phase-space 
slicing [576] and subtraction [352, 353, 542]. In both cases the real-emission contribu- 
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tion will be written in D dimensions — in the toy model here this amounts to replacing 
the x! pole by x~!~*. Then the ideas underlying both methods are as follows: 


1. Phase-space slicing 
The idea [573] is to introduce an arbitrary cut-off 6 and to rewrite the NLO part 
of the cross-section as 


1 
Va pJ dx J 
oD = els | ROR @) 
0 
r ‘ A i 3 (3.126) 
n pJ T J T FI 
= z F; [rw 2) FS 4 (£ + | Rac n( Fai (T ). 
0 ô 
This can be approximated up to first order in € as 
V, f d f d 
n T x 
0 ô 
V, i d 
fi x 
= [1-6] ee qire in(2 #) Fi’ 1(#) + O (e) (3.127) 


a 


II 


dz 
logð Vay + | SER) PaE) +0 (0), 


6 


where 6~© = 1—e log 8+0 (e°) has been used. This procedure has a nice additional 
feature, that the answer should be independent of 6. However, there is a tension 
between retaining a good singular approximation (which is obtained by choosing 
a very small 6) and still avoiding corresponding large logarithmic cancellations, 
which leads to a preference of larger 6. An illustration of this check is shown in 
Fig. 3.6, taken from a calculation of Wbb production at NLO [520]. This ambiguity 
in choosing an optimal value of 6 has made this method in practice going somewhat 
out of fashion, since manual interference and careful monitoring of the numerical 
stability of the final results are mandatory. 


2. Subtraction methods 
This problem is alleviated by subtraction methods, where a zero is introduced 
into the result by adding and subtracting a term 


1 
dx 
0) F7 T aie (3.128) 


such that the first-order contribution reads 
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Fig. 3.6 An illustration of the dependence of a full NLO calculation on the 
phase-space slicing parameter. Reprinted with permission from Ref. [520]. 


1 
V, dx 
1) _ Yn pJ J 
ot) Fe + f ROR aw 
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V, ; d 
n pJ T 
z En + Rr (0) F; [= 
0 
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xv T 
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0 0 
Vn p i d 
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This subtraction is possible because the infrared structure of the toy model is 
entirely known and fixed by Eq. (3.125). In real applications the case is not quite 
as trivial, although the infrared structure still is fixed by the fact that in the soft 
and collinear limits of parton emission the respective real-emission contributions 
factorize into a process-dependent Born part and a process-independent parton 
splitting part, where the spin dependence is given by the Altarelli—Parisi splitting 
kernels already encountered for the case of QED in Eq. (2.9). 
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3.3.2.2 Toy model vs. cross-section in the subtraction formalism 


Translating the toy model of the previous section to the language employed in the 
rest of the book, in general a cross-section at leading and at next-to-leading order is 
written as 


g(hO) = [avs By, (Ps; ur, UR) 
g(NLO) — fass 8, (Bas nrun) + Vn(®p; wr, uR) + T°) (®g; wr, un) 


+ far [Ra Orin un) — So(Br pr) ; 
(3.130) 


cf. Eq. (3.120). As before, ®g and ®z denote the Born and real-emission phase space, 
respectively, and Bn, Vn, Te) Rn, and Sn, are the matrix elements for the Born, 
renormalized virtual, integrated subtraction, real-emission, and real subtraction con- 
tributions. 

The integrated subtraction and the real subtraction contributions actually corre- 
spond to the terms 


1 
dx 
ae J 
+Rm (0) FA f ee (3.131) 
0 


which are combined with the virtual correction V,,F7 /e and with the real correction 
term in the toy model. By now, there are different, well-established methods to con- 
struct such subtraction terms in a process-independent way. Note that for hadronic 
initial states V has been defined such as to include the collinear mass-factorization 
counter-terms related to divergences that are absorbed into the definition of the PDFs 
for the incoming partons, essentially the terms in the last line of Eq. (2.118). All 
integrands include parton luminosity, symmetry, and flux factors. 


3.3.2.3 Example: W production at NLO with simple subtraction 


To see how the subtraction method works in more detail, it is instructive to consider 
the real radiation contributions that enter the NLO corrections to W production. 

The starting point is the real matrix element squared for the process ud > gW+ 
given in Eq. (2.101). As already observed, it is divergent in the limits tf — 0 and û — 0. 
The key to a general approach for handling these infrared singularities lies in the ability 
to deal with the collinear singularities associated with each of these limits separately. 
This can immediately be seen by analysing the kinematic dependence, isolating the 
divergent terms and then using partial fractioning, 


^ A 2 
2 a- 2° 4 t ^ 2 2 â 
tf +u + 2miys P ( +ô) = My § 9 (3.132) 


ta, ta 
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In the last line, overall momentum conservation has been employed, embodied by the 
identity ê + É + û = mj,. Furthermore, the terms in square brackets in the final line 
can be written in terms of the single dimensionless quantity 


a= my /8, (3.133) 
leading to 
LO 2 2nCras("UR) LO 2 [E +A +2m2,8 
eae a) ad Mo ars i aa a (3.134) 
Ww a 
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for the amplitude squared of the real-emission contribution. Isolating the divergent 
terms x 1/t or 1/å, the equation above can be cast into the form 


Piet oe (LO) 
_ r ud>Wwt 


2 [Dee 2) + D(a, x) + Ra] l (3.135) 


The infrared-singular terms are expressed in terms of the function D(t, x), where 


1 2 
D(t, x) = 8nasCr | = ( 1 °)| ; (3.136) 
t \l-«@ 
and the non-singular remainder reads 
22 
R(x) = 81a,Cr a 3 (3.137) 
Mw 


The sum of the two singular terms therefore forms the subtraction term, 


ud=>W+ 


1 
S(r) = = |M 


k [pé 2) + D, 2) (3.138) 


It is worth noting that they are identical to the dipole subtraction terms used in 
the seminal Catani-Seymour paper [353], but they have been simply derived from the 
original singular matrix element. The general form of the subtraction terms in Catani— 
Seymour subtraction will be discussed in the next section. This dipole term is also 
closely related to the standard Altarelli—Parisi splitting function for a quark into a 
quark and a gluon. There is one dipole for each of the ¢ and û singularities, but due to 
the simplicity of the process and its symmetry with respect to the initial states their 
forms are exactly identical. 

To compute the corresponding integrated subtraction term, Z(®) in Eq. (3.130), 
first the phase space must be appropriately factorized to allow an analytic integration 
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over the one-particle phase space of the emitted gluon. The phase space is given 
by 


d?”pw 2 2 day 2 DsD 
dwy = p (27) 6(pw — My) Bp (277) 4(p5) (27) O° (Da + P — PW — Pq) 
(27) (27) 
= (27)? -P dP "Ps 6 ((pa 4 )? — my) (3.139) 
= 2E Pa T Pb — Pg Ww ‘ 


where, in the last line, E represents the energy of the gluon. It is straightforward to 
evaluate this in the c.m. frame. The result reads 


(Qa)? ey ( at) ee re eee eee 
ddyw,=-—— {=} d[- dé dQ'~**6 (5+ f+ û-— , (8.140 
Wg 2/5 3 WE (8 u miy) ( ) 


where dQ?~ is the solid angle element in D — 3 dimensions, 


27 
1—2e _ 


It is convenient to use the two dimensionless variables defined by 


g+t+a 7 t 
pete. tü _ mw us ee 
S 


~ me ; (3.142) 
where the former has been simplified using the Mandelstam relation § + ê+ @ = m3, 
from Eq. (2.100). The phase-space integral can be expressed through them as 


gl-e dq!-2< 


gO ae (27)! 22 


da dvv~* (l—a—v) [2r ô (x8 — miy)]. (3.143) 
The factors in square brackets can now be recognized as the final-state phase space for 
W production, at the reduced c.o.m. energy squared, xs. The phase space is thus an 
integral over x of the convolution of this reduced phase space with the dipole phase 
space defined by 


r gl-e dQi-2. 2 
dé(a,v, 8) = 16x? (Qn) dvv *(l-x-v)~, (3.144) 


where the integral over v ranges from 0 to (1— x). In this case, since the only partonic 
content in the leading order process is in the initial state, this corresponds to an 
initial emitter and initial spectator, anticipating the language of the Catani-Seymour 
approach that is used in the next section. 

The dipole phase space derived above can be used to integrate the single dipole 
term of Eq. (3.136). Restoring the correct overall dimensions with a factor of 17°, 


u” f Pia) dé(a, v, 8) 
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Here, the convenient constant cr of Eq. (3.62) has been used. The v integration can 
be evaluated by substituting the variable v — (1 — x)y and then using the result 
in Eq. (A.6): 


[wera ‘ ves = oe (1—2)-*. (3.146) 


The result is a function of x that is well defined as € —> 0, except at the point x = 1 
where a non-zero value of € regulates the divergence. A convenient way of extracting 
this singularity is to employ the same + distributions that were already used for the 
splitting functions, cf. Appendix A.1.2. Applied to the case at hand, this allows the 
following replacement to be made, 


zE(1— x)? silg) 1 ern 
E 7 272 
_ ae 2e log(1 2] g (1 AEE ) ea x) 
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where terms of order £? or higher have been dropped. The remaining terms are non- 
singular in the limit that x — 1 and so do not require the introduction of further 
+-distributions. Thus the integrand becomes 
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In this equation the regularized splitting function, to first order in ag, PR (2), 
appears. It has been given in Eq. (2.33). 
Using the relation between gamma functions given in Eq. (A.5) ultimately yields 
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(3.149) 


There are two such contributions that take identical forms originating from the D(f, x) 
and D(û, x) terms in Eq. (3.135). The sum of the two is the form of the integrated 
dipole term represented by Z‘) in Eq. (3.130). Note that, since the starting expres- 
sion in Eq. (3.136) was defined in four dimensions, this calculation has implicitly been 
performed in the dimensional reduction scheme. Working in conventional dimensional 
regularization would have introduced an extra term —e(1 — x) in that equation that 
would eventually manifest itself as an additional +(1 — x) inside the square brackets 
in Eq. (3.149) 

With the explicit expression for this contribution at hand it is possible to see 
exactly how the factorization of singularities into the parton distribution functions 
occurs. The relevant singular contribution in Eq. (3.149) is given by 


2 E 
Os ( HF low 
a (£) af -Poa (x) >, (3.150) 


since the remaining singularities proportional to d(1 — x) cancel with those obtained 
from the virtual diagrams. The NLO quark PDF fy/n(y) can then be defined in terms 


of the bare PDF Ea (y), which does not exhibit any scaling violations, as follows 


(a4 l L 
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in order to absorb this singularity. The factor of 1/x present in this equation can be 
traced back to Eq. (3.135) while the dependence on te (y/x) is a consequence of the 
fact that xê = mj). In addition to the pure 1/e singularity, this definition also includes 
constant terms that are obtained by expanding out the universal factor cr. 

Note that it is only the pole term in Eq. (3.151) that must be absorbed in order 
to arrive at a finite result while the constant terms are a matter of choice. The choice 
of additional, finite constant terms that are absorbed defines the factorization scheme 
that is used for the PDFs. The definition in Eq. (3.151) corresponds to the modified 
minimal subtraction, or the MS-scheme, which is the preferred definition for all 
modern PDF sets. This scheme dependence must also account for the particular reg- 
ularization scheme used in the calculation. The redefinition in Eq. (3.151) corresponds 
to conventional dimensional regularization. In the dimensional reduction scheme one 
must make the replacement, 
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Hence, after this factorization has been accounted for, the contribution of a single 
dipole term in Eq. (3.149) can be written as, 
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In this equation the regularization scheme dependence has been captured by the con- 
stant a©PR, previously introduced in Section 3.3.1. Notice that, as must be the case, 
once both dipole contributions are accounted for the dependence on a©P® in this 
equation exactly cancels that from the virtual corrections, given in Eq. (3.64). 

In fact this result is also very close to the one for the full real corrections previously 
quoted in Eq. (2.118). The two differ by a single term which is given by the integral 
of the non-singular remainder in Eq. (3.137), 


ue / Ria) dole.0,3) = aCe ( = ) eI X1 —2)]. (3.154) 


3.3.3 Catani-Seymour dipole subtraction 
3.3.3.1 General idea 


In general, subtraction methods like the one above, based on the form of the actual 
real-emission matrix elements, are not very useful when considering more complicated 
topologies or even the automation of the subtraction methods in form of computer 
code. In such cases, process-independent methods such as the FKS method [539, 542] 
certainly are advantageous. 

Maybe the most frequently used of such algorithms is known as Catani-Seymour 
dipole subtraction (344, 353]. The idea underlying this method is to treat the emis- 
sion of additional particles in phase space in such a way that the soft and/or collinear 
limits of the individual emission factorizes from underlying Born-type configurations. 
This is possible, in a process-independent way, by realizing that in the soft limit the 
emission cross-section of an additional particle — typically a gluon or photon — can 
be written as the product of an eikonal, cf. Eq. (2.14), and a corresponding Born level 
cross-section. Similarly, in the collinear limit, emissions can be written as the product 
of a Born-level cross-section with a splitting kernel of the form introduced in Eq. (2.33). 
In Catani-Seymour subtraction these two limits, soft and collinear, of the emission are 
analysed by introducing a spectator parton k. This spectator parton also allows the 
construction of phase-space mappings i + j +k — {ij} + k for the combination of 
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partons 7 and j into an emitter {ij} with momentum ,;, while parton k accounts for 
the recoil and changes its momentum from pk to pz. This not only allows all particles 
to be kept on their mass-shell all the time, pz = p} = mz and pij = m;,, but it also 
directly leads to a factorization not only of the matrix elements, but of phase space. 
The latter fact is important to allow a calculation of Born-level matrix elements with 
one less particle. The catch in this construction now is that these extra emission bits, 
{ij} +k — i+j +k can be integrated analytically in D dimensions, thereby allowing 
them to be subtracted in differential form from the real-emission matrix element and 
added back to the virtual contribution in integrated form. 

In order to put both types of contributions together, it is sensible to decompose 
the eikonal cross-section term W(p1, p2; k) with its symmetry between both emitting 
partons, with light-like momenta pı and pg, into two individual terms with a definite 


emitter and spectator assignment. Including colour factors, they schematically read 


Tı- Tə (4 D> J pıp2 
W(p1, p2; k) = Sh ep ee 
(Pi, Pai k) 2 pık pok 12 (ork) (pok) 
P1p2 P1p2 
=T,- T i (3.155) 
g poe +p2k)  (p2k)(pık + p2k) 
= Dik:2 T Dok, 


where the two dipole terms Dir; have been introduced, with 7 denoting the emitter 
or splitter, k being the emitted particle, and 7 denoting the spectator. Note that the 
colour generators T; j related to the particles 7 and j act as matrices on the full colour 
structure of the Born-level cross-section the eikonal is being attached to in order to 
facilitate the extra emission. 

Analysing these terms shows that each of them diverges in both the soft limit of 
the energy of the emitted parton approaching zero, wkp — 0 and the collinear limit 
of its momentum becoming parallel to the momentum of the splitter, k” || pi’. At 
the same time, the individual terms do not diverge for the emitted momentum k 
going parallel to the spectator momentum. This helps to disentangle soft and collinear 
divergences. It also helps to analyse singluarities related to the eikonal — essentially 
the soft and soft-collinear ones — in colour space, including leading and sub-leading 
colour contributions and to add the hard collinear divergences, which are encoded in 
splitting functions and are a leading colour effect. This means that the eikonal-based 
dipoles above have been generalized in such a way that they also contain the collinear 
bits encoded in suitable splitting kernels. Full dipole subtraction terms D therefore are 
introduced that emerge from the combination of Born-level matrix elements squared, 
including all symmetry and PDF factors, with terms similar to the D from Eq. (3.155). 
The catch here is that this factorization of the full subtraction matrix element into 
Born-level parts and individual emission terms also includes a factorization of the 
phase space such that for these terms 


Bp = g Q Bı. (3.156) 
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Considering in the following a 2 > n-—process at Born-level with momenta papy —> 
P1p2...Pn, the subtraction procedure uses two terms: a differential term, which can be 
written in the form of a sum of differential dipole cross-sections with an additionally 
emitted parton, D(pa, Pb; P1, P2, ---; Pn+1) to be subtracted from the real-emission 
contribution, 


S(®p) = 5 D(Pa, Pb; P1, P2, reep Pnti) 
dipoles 


= J Bijs(®B) ® Dijr(®1) — B(®8) 9 D(41). 
ij,k 


(3.157) 


Here, to simplify the notation, the sum over dipole terms Di;. 4(®1) has been replaced 
with the product of a dipole operator, D(®1), with the summation implicit. 

This structure is reflected by a sum of the corresponding integrated terms, to be 
added back to the virtual contribution, 


I) (Gz, e€) = 5 TOP) (pa, Po; P1, P2, -+ oy Pi—1, Pi+1, 25 Pn+1) 
dipoles Z (3.158) 
= Ñ Bir(®g) © TGL) — B(®8) 9 D(z). 
ij,k 


Here, one final-state particle has been integrated out. This stresses that there are only 
n particles in the final state of this integrated terms, which therefore have Born-level 
kinematics. 

The following discussion will proceed closely along the lines of the seminal paper 
by Catani and Seymour (CS) [353]; a closer relation with their paper will be worked 
out in Appendix C.2, where individual equations in CS will be directly linked to the 
expressions below. 


3.3.3.2 Catani-Seymour dipole subtraction for final-state partons only 


For example, for the case of a dipole with both splitter ij and spectator k being final- 
state particles, these differential dipole subtraction terms, denoted as Dj;.~, schemat- 
ically read 


Dij;k (Day Po; P1, P2, +++) Pn) 
= B(pa, Pb; P1, P2, +++, fij; re) Dk, Sos Pn) & Dij:k (Pi, Pj, Pk), (3.159) 


where the Dijk are the actual dipoles. 

The momenta pij and Pk in the Born term emerge from the combination of the 
momenta p; (the splitter), p; (the emitted parton), and p;, (the spectator). In the case 
considered here, massless final-state particles, 
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E Yij,k 
Pij = Pi + Dj — EL 
— Įij,k 
i (3.160) 
Bk = = Pk, 
1 — Yij,k 


so that the spectator parton keeps its direction but its momentum is stretched by a 
factor 1/(1 — Yij,k), where the dimensionless quantity y;;,, is given by 


PiPj 


Yijk = ; 3.161 
(oS O pipj + Pjpk + pkpi Gi 
The splitting functions depend on both yij,k and the splitting parameter 2;, 
z PiPk DiPk is : 
Zn = — and Zz; = 1- %. 3.162 
"(pi +D;)Pe Big De > i aee 
With these parameters, the typical form of the divergences can be written as 
PiPj + Pjpk +PkPi _ 1 (3.163) 
(pi + pj)pr 1 — %(1 — yij,k) 
In the collinear limit, p; || pj, or, more formally, 
k? n” k? n” 
B o= ype ppi BL ond pf = (1 oH Ah + 3.164 
Pp a ne (1—2z)p Los ( ) 


where p” denotes the light-like collinear direction, and the transverse momentum k_ 
is perpendicular to it and to the auxiliary light-like vector n”. Here, z is a splitting 
parameter, which can readily identified with z; from above. Then, the collinear limit 
is given by 

ki 


—-——=— with k, >0. (3.165) 
z(1— 2) 


2piPj = 
In this limit, 
ki 
ij > = SS 3.166 
ae 22;(1 — %) bij De ( ) 


and, of course, Pk —> pk and Pij + pi+pj = p. Similarly, in the soft limit, yije —> 0, 
2 > land, again, pk + pk, while pj; —> pi- 

For all three particles, splitter, emitted parton, and spectator in the final state, the 
dipole reads 


/ 
Bis = -ap MMi (3.167) 
The colour factors T must be inserted at the right place into the Born matrix element 
and in addition the spin or polarization states labelled by |s) must be accounted 
for in the product of Born-level matrix element and dipole. The actual information 
concerning the flavour of the partons and their impact on the kinematics is encoded 
in the kernel Vij k, which for the case of a quark emitting a gluon is given by 
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2 
l= 2:0 gee) 


(s|Vo.g;:k15 ) ae Sru CrasluR) | (1 + Zi) = e(1 = ži) ss. 
(3.168) 
The corresponding integrated dipole term Z(P) (e) is given by the integral of the 


dipole over the emission phase space of parton j, namely 


2 2 E 
(D) as (LR) Anup Ti;- Tx 
TË = (£), .169 


where the V;j(€) can be constructed from the following expressions: 


2f 1 T? 1 
Val) STi |z gz) tut eti) +e 


z2 
2 
cr (5-7 fori = q 
K = 2 6 
PE: 67 r 10 
Set oe RE Suerte. po es (3.170) 
a(g =) g RNs fori =g 
$0 fori = q 
ce ee 


2 
6 Ca — grin fori = g. 


The anomalous dimensions ae have first been encountered as the terms proportional 
to 6(1 — z) in the construction of the DGLAP splitting kernels, Eq. (2.33). The term 
K, actually will return in another context, namely in resummation, where it will be 
denoted as K and relate to a generic, flavour-independent higher-order correction to 
the emission of a soft gluon, cf. Eq. (5.65) in Section 5.2.1. These terms effectively also 
parameterize the contribution of collinear logarithms in Q, resummation, see also 
Eq. (5.66). 

As before, for the differential dipole terms, an integrated dipole operator I(®g; £) 
is being constructed such that, again, 


IS) (Og) = X Bijr(®B) © TGE) — B(g) 8 I(®s; £). (3.171) 
ij;k 


3.3.3.3 Example: Catani-Seymour dipole subtraction for e~ et —> qq 


To see how this works in practice, consider the case of e~e* — qq at next-to-leading 
order, closely following the example in Appendix D of [353]. To keep things simple, 
the exchange of a Z boson will be ignored and only the case of a virtual photon in the 
s channel will be considered. In this case, the dimensionless parameters y;; ;, reduce 
to the scaled invariant mass of the pair, 
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2PiPj 
Yij,k = Q2 ; (3.172) 
where Q is the centre-of-mass energy of the hadronic system, ê = Q?. With eq the 
partial charge of the quark, and the Born phase space ®g being a function of the quark 
and anti-quark momenta pı and p2, § = (pı +p2), the leading-order cross-section can 
be written as 


(LO) _ n 4ra? e? 

o = fars B(®g) = EE (3.173) 
Note that in comparison to the original work in [353], the jet function F ({p;}) 

has been ignored by setting it to unity; this corresponds to a calculation of the total 

cross-section. Other choices would translate into cuts on the n—particle final state: two 

for contributions with Born-level kinematics and three for real correction terms. This 

implies that in the limit of soft or collinear emissions PEHD must reduce to FY, 


n41) soft, colli F 
PA ee (3.174) 


This is a fairly trivial manifestation of the requirement of infrared safety for cuts on 
kinematic configurations in physical processes. 


The three-parton matrix element for ggg production of course corresponds to the 
real correction at NLO to the process and it is given by 


_ 8aC pas(UR) r? + x3 
R(pı, P2, p3) = g2 ETT B(®z) (3.175) 


with x; = 2p;Q/Q?. The respective real-emission phase-space element, conventionally 
expressed through the fractions x; reads 


2 
TE 2 A T ERE EE a ER (3.176) 
The dipole subtraction terms are easily constructed. The splitting function is of 
course the qg splitting function from Eq. (3.168). Inserting it into the Born-level matrix 
element is fairly straightforward due to the Kronecker ô in the spins, and the only 
somewhat tricky bit is the colour factor coming with it. Taking a closer look, the 
relevant term actually reads 


S Vn 3.177 
2PiPpj Ti i ( 


where the T;; and Tẹ are the colour matrices related to the splitter and spectator 
that are inserted into the matrix element. In the dipole term here, T;; = Ty = T13 
and T} = Tg = To. By virtue of the fact that the overall amplitude must be a colour 
singlet, for each splitter ij the sum over all spectators k of the term T, will result in 
—T;;. In the case here, this implies that 
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1 ret 1 
is ah (3.178) 


® = 7 
2pipj TŽ 2pipj 


The dipole term relating to the gluon p3 being emitted from quark pı therefore is 


given by 


_ ssc, ou 1 =0) Tis: T2 
D13;2(p1, P2, p3)°—) = B(pis, P2) ® |- 2p1p3 i t T? 
13 


8rCrasluR) | 2 
2p1p3 1— (1 — y13,2) 


= B(p13, P2) Q 


H (3.180) 


where the quantities y13,2 and 2, in the general expression for the mapping in Eq. (3.160) 
can be expressed through the x; in the specific case here as 


Ñ= 
Y13,2 = 1- T2 and Ži = 3 . (3.181) 
T2 
Inserting this and using that x; = 2 — £j — £k, 
D13;2(p1, p2, p3) = 
-L 8rCrasluR) 2 1 — z1 — T2 
ire . 1 | 
B(pisz, p2) (1 m r2)Q? P T1 — T9 T2 
w a 8rCraslurR) 1 2 1-2 
= 6 . 1 ‘ 
(P13, P2) Q? foe (2m1 — Tı T 
(3.182) 


and a similar term for Də3,ı with the replacement 1 + 2. Together, these two terms 


constitute the subtraction term, 


S(r) = S(pı, p2, ps) = D13;2(p1, p2, pa) + Dosa (pr, p2, ps)". (3.183) 


The subtracted real-emission term therefore reads 
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on (1 — #1)(1 — 22) 
i 7 1 2 1-2 
— dg Bis, Ba) - Fee (= k n) | T2 7 
4 B 1 2 1-2 
—d®x B(fo3, P1) ` E (H 1 va) T1 l 
(3.184) 


Casting the three-parton matrix element into a form similar to the subtraction terms 


(1—zı)(1— z2) 1-22 


2 2 1 2 
A (5 1 v1) boy Sy E 


— Tı — T2 


makes the cancellation of divergences explicit. The subtracted real-emission contribu- 
tion therefore is given by 


Cra, 
do®-9 = — reskin) dbz B. (3.186) 
T 


The subtracted one-loop correction consists of the genuine virtual contribution and 
the integrated subtraction term, in this case 


do +D = dög [Vies) + I) (Gg: °)| 


= dbz Vie) + B(s) © I(®g; e)| ; (3.187) 
The individual integrated dipole terms Te) and its barred counterpart, which consti- 


tute I(®g; £), are given by 
€ 
(D) Ty:Tz as(ur) (ATUR 
Toa-q(® = VY 
agg 2B; €) T? Qn (1—e) z ag(€) 


Ss 
_ Cras(ur) (Aruh “Tq ae ; 1? 
~ T(1—e) 8 : 


(3.188) 


e2 | Qe | 2 


Again the fact has been used that the overall amplitude must be a colour singlet, 
implying that, as before, the colour matrix Tg = —T, and therefore Ta: Tg = —CF. 
Together these yield the overall result for the subtracted virtual contribution to the 
cross-section, namely 
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ol = don Blon) | Seale (TA) (2 2 84074000) 
Cras(ur) (4tuR\° ( 2 3 2 
serein) (Ste) ( Z+2+10-7+0@)]| 


= dbz B(®z) a 
(3.189) 


As anticipated, the overall result for the subtracted virtual contribution also is finite, 
rendering both individual parts of the next-to-leading order correction separately finite 
and therefore allowing their safe numerical integration. 
However, in combination with the real-emission part above this yields the well- 
known result 
NEO) 00 ( 


ete->qq 


(3.190) 


3.3.3.4 Master equations for Catani-Seymour dipole subtraction 


As a consequence of this decomposition into splitters and spectators, there are four 
types of dipole structures in Catani-Seymour subtraction, namely all combinations of 
initial and final splitters with initial and final spectators. In the original CS paper, 
initial state particles are indicated by moving their label from Eq. (3.155) from sub- 
script — reserved for final state particles — to superscript. This is also exhibited in 
Fig. 3.7. 

The full differential subtraction cross-section, including emissions from splitter— 
spectator pairs in all combinations of initial and final state particles, written in the 
notation of this book, is given by 


da‘) (Pa, Pb; P1, P2, +--+ Pn+1) 
= dër > Dij;k(Pa, Po; P1, P2, se Pari) +Y Dij (Pay Po; Pi; P2, sey Dn+1) 
y {ig} {ij} 
kÆi,j à 
+ ` D (Pa, Po; Pi, P2, +++) Pnti) + 5 DY? (Da, Pb; P1, P2, +++, Pn+1) 
{aj} {aj} 
kAj b4a 


(3.191) 


In this equation, the first line in the square bracket refers to emitters in the final 
state with the first term referring to a spectator in the final state, whereas the second 
term relates to a spectator in the initial state. The pattern repeats itself in the last 
line, with the only difference being the splitters in the initial state. Of course, in all 
cases kinematic maps similar to the one in Eq. (3.160) will be invoked to construct 
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Visi 


Pk k b A 


Fig. 3.7 Pictorial representation of the different Catani-Seymour dipole 
types: final-state splitter with final-state spectator (FF) (top left) and 
with initial-state spectator (FI) (top right) and initial-state splitter with 
final-state spectator (IF) (bottom left) and with initial-state spectator (FI) 
(bottom right). The blob denotes the m-parton matrix element, incoming 
lines enter from left, and outgoing lines leave the blob to the right. 


the corresponding Born-level kinematics. For the details of this construction in the 
various cases, cf. Appendix C.2 

Turning to the actual construction of the subtraction terms, the terms relating to 
final-state emitters in the first line have very similar structures, namely 


1 T;;- Tk 
Vij,k — 
2pipy ” TŽ 
(3.192) 
for the final-state splitter final-state spectator (FF) case, cf. Eq. (3.179). For the FI 
case, 


Dijik(Pa, Po; (Pay) = B(pa, Po; ens Pijs +445 Dey ---}) Q 


1 1 T; Ta 
2D;P; Tija ’ TŽ ? 
(3.193) 
indicating that the momentum of the spectator parton a in the initial state will be 
altered. In both cases, FF and FI, only the relevant momenta have been shown as ar- 
guments of the Born-part B. It is also understood that due to the implicit cancellation 
of IR divergences in the difference of real-emission and subtraction terms, the dipoles 
V ultimately will be evaluated in D = 4 dimensions, with € = 0. The kinematic 
maps for the two cases, FF and FI case, including in particular the extra term Tija 


Di; (Pay Po; {pi}) = B(Ba; po; {---> Bij, ---}) @ 
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in the propagator-like structure before the dipole, is detailed in Appendices C.1.1 and 
C.1.2. It is worth noting that, due to the initial-state parton compensating for the 
recoil of the emission process in the FI case, the Born-level matrix element, including 
the PDFs must be evaluated with different initial state kinematics, as indicated by 
Paj in its arguments, and of course also flux and symmetry factors must be adjusted 
correspondingly. This, however, is a fairly trivial manipulation. 

This simple picture somewhat changes when initial-state singularities emerge due 
to initial-state particle being the emitters. These cases are treated by the subtraction 
terms in the last line of Eq. (3.191). The reason for this added complication is that, 
first of all, the factorization of the phase space in the soft and especially in the collinear 
limit must be checked, which has been done in great detail and instructively in the 
original CS paper [353]. In addition, now the altered initial-state parton momenta 
entering the Born term translate not only into the need to evaluate the PDF for 
this parton at a different x, with a potentially changed flavour, as above, but the 
emission off the initial-state parton also introduced new divergences which must be 
absorbed into the definition of the PDF. Essentially, this is because the particles in 
the initial state are fixed to move along a preferred direction, the beam axis. This 
additional constraint, which is not present for final-state emitters, necessitates special 
treatment and the emergence of additional terms. Different ways of how this is being 
done and which finite terms are absorbed together with a collinear divergence give rise 
to different factorization schemes, all of which of course are variants of the collinear 
factorization. This implies that scheme-dependent terms may arise, which must be 
properly accounted for. With this in mind, 


1 I 
2PaPj Tjk,a £ Ti, 


DY (pa, Pb; {pit) z B(Baj, Pb; {. <; Pks aay) & 


for the IF case, and 


1 1 yob Taj: Te 
2PaPj Tj ab T2; 
(3.195) 
for the II case. Details of the phase space mappings and the appropriate splitting 
functions V% and V? can be found in Appendices C.1.3 and C.1.4, respectively. It 
is worth stressing here that by far and large the kinematic mappings are organized 
such that spectator partons keep their direction. This of course is not feasible any more 
when the splitter parton is in the initial state: in the case of IF splittings then the 
final state spectator must compensate the transverse momentum transfer; in the case 
of II splittings this is of course not possible. In this case, the transverse momentum 
compensation is achieved by moving the complete final state. 

Turning to the integrated subtraction terms, to be added back and combined with 
the virtual parts of the NLO calculation, one has to remember that they are essentially 
given by the integral over the real-emission phase space of the differential ones. In 
principle, this is fairly straightforward and results in 


DY (Da, Pb; {pi}) z B(Paj, Do; {3 Leg Pk, + Gee }) 8 
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do = d&g B(®g) ® I(®g; £), (3.196) 


where the integrated dipole term is constructed from terms with final- or initial-state 
splitters and spectators in intuitive notation as 


I(®z; £) Z Irr (®g; £) + Ip;(®g; £) + Izr (Ëg; £) + Izz(®g; £). (3.197) 


The integrated dipole for the final-state—final-state case is given, as before, by 


Anyi © Tra + Ty 
“ard Jh 2 ( a a —~ Visle) > (3-198) 


2pri 
) Cay eto | PPUNPR 


Irr(®g; €) = 


where the sum is over all pairs of particles {ij} and k. Note that here the “splitter” 
parton has been denoted by {ij} in order to make the connection to the splitter- 
spectator notation in the differential dipole terms more manifest. The terms Vg; (€) 
for different flavour are given in Eq. (3.170). Naively, of course, terms with an identical 
form, apart from trivial replacements in momenta and colours, also emerge for the 
other integrated dipoles. 

This, however, is a bit too simple to be entirely correct. In cases with initial state 
partons a complication emerges, which originates from the fact that the kinematics 
mappings imply a change of the incoming particle’s momentum, irrespective of whether 
it is a splitter or spectator parton. Schematically, for the Born-level n-particle phase 
space and the matrix element this will be accounted for by 


dög TEEN de ddg(E) 


+ (3.199) 
B(@g) TEE Bios, £). 


For the integrated dipole this translates into an additional integration over the re- 
coil parameter £, in the interval [0, 1], which will ultimately lead to the emergence 
of various functions of € to be folded into the resulting expression. In principle, € 
parameterizes how much the momentum of the incident parton acting as splitter or 
spectator changes, and it will therefore also change the four-momentum balance ac- 
cordingly, encoded in a 6 function in the phase space element d®g. In addition, if the 
parton in question actually is treated as the splitter then this change in momentum 
will also impact on the argument of the PDF, replacing the corresponding x with «/€. 
As already indicated in Eq. (3.149) in Section 3.3.2, there are also additional terms 
that must be considered when emissions off initial state partons are present. These 
terms are related to further collinear divergences originating from the fact that the 
matrix elements must be convoluted with PDFs, which themselves exhibit divergent 
structures in their scale evolution. A fixed-order part of this evolution actually emerges 
when considering initial-state singularities alone in the matrix elements and therefore 
must be treated through a suitable collinear subtraction. It is no surprise that these 
terms conversely exhibit a dependence on the details of the factorization scheme, man- 
ifesting themselves in some finite terms. In the MS scheme used throughout this book 
these terms reduce to zero, but in other schemes such as the DIS scheme this is not 
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the case anymore. 

These collinear divergent terms lead to single poles already encountered in the 
previous section, cf. Eq. (3.150), and are of the form p7%/eP;;(z). For every initial- 
state parton a, this collinear subtraction term reads 


1 
mp2, \~ , 
dol) = > f agaoa(e Bs £) |- (EE) PRO + KEO. 6200) 
a’ 0 


Here, po ) (£) are the Altarelli—Parisi splitting kernels to first order; their four-dimen- 


a 
sional version is given in cf. Eq. (2.33). The finite terms K ae’, (6) are related to the 
choice of factorization scheme. As already indicated, for the MS scheme, they are given 
by 
Kes (6) = 0. (3.201) 
Therefore, after absorbing the pole in the PDF, residual terms are left stemming from 
their expansion in €. 

The form of this subtraction term in fact could have been anticipated without 
any calculation, they merely connect the PDFs from their scale wr with the parton 
density at the scale u where the rest of the process and in particular the singularities 
are evaluated. Not surprisingly the implicit change of parton density must be accounted 
for, in a form similar to the DGLAP equation, which is why the respective splitting 
kernels emerge. If the process has one initial-state parton only, merely one of these 
terms has to be considered; in the presence of two initial-state partons, two of these 
terms with suitable choices of flavours, etc. , have to be added to the virtual part. 

In addition, and as already indicated, there are also terms coming from the initial- 
state parton in the integral over the emission phase space in the dipole terms, like 
the ones in the second line of the integrated dipole in Eq. (3.149). It has become 
customary to construct the integrated dipole terms for initial-state partons through 
specific collinear subtraction terms, extending the one in Eq. (3.200). These terms are 
added to the simple dipole functions constituting the integrated subtraction terms,® 


do%+°) = d®gBas(pa, po) Q Ie) 


1 
+ > déa d®g (Ea) Bars (EaPa, Po) Q [Ke (fa) + po (Pa; Ea; n) 
a’ 9 


1 
+ 2! dés dg (£v) Baw (Pa, Eepo) Q [K (&) + PH (Spo, £; 1)| 
v o 


(3.202) 


8It is worth noting that similar reasoning in fact also applies for processes involving specified 
hadrons in the final state and the corresponding fragmentation functions: also in this case collinear 
subtraction terms must be added which stem from the evolution of the fragmentation functions 
through secondary emissions. 
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where the dependence of the Born cross-sections on the incoming flavours and momenta 
has been made explicit. The integrated dipole terms are given by Eq. (3.198) and 
similar. They can be combined to yield 


I(e) = I (pa, Po; Pi, ---, Pm; €) 
e ar eee) a 
al T E A PUA let ?PiPe 
oree o A 
2 c av 2 Cc 
cE {a,b} T i 2PpcPpi 2pcpa d#c 
(3.203) 


The terms in the first square bracket relate to dipoles with final-state emitters i — 
the first sum is over final-state spectators k and the second sum is over the two initial 
state spectators c — while the second square bracket relates to dipoles with initial- 
state emitters c, where, again, the first sum is over final-state spectators 7 and the 
second term is for the other initial-state particle d # c being the spectator. In all 
cases, the V are given by Eq. (3.170). 

The operators in the collinear terms, for instance Ko (€,) and Pa’ (EaPa, Ea; tee) 
emerge from the considerations above and are given by 


F 1 T ij -Tw 
(Ea) +822 A {ij} 


aa 


K” (£4) = as (HR) K 


2 
an {ij} Tij 
Ty Ta Sea! z 
~ y K (fa) — Kets (Ea) (3.204) 
T2, 
and 
PY (erpa a iy) 
as( UR) p(1) Lay Ea ua TiTa u2 
=s gat Se l } l A 
a Peale) De Sapna T 8 apan 
ij 
(3.205) 


cf. Eq. (C.45). The ae in the first line of Eq. (3.204) are the anomalous dimensions 
to first order of the DGLAP splitting functions introduced in Eq. (2.33). The functions 
K and K can be found in Appendix C.2. As already stated, the term Kpg. related to 
the choice of factorization scheme vanishes for the choice of the minimal subtraction 
scheme, the one made in this book, Ayzg(€) = 0. The first-order splitting kernel Po) 
occurring in the P-operator has been given in Eq. (2.33). 

Note the occurrence of the sums over final-state particle {ij} in both the K and 
the P operator. While in the former case these sums emerge from dipoles where the 
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initial-state particle a acts as spectator and the final-state particle {ij} is the splitter, 
as indicated by the colour factors, these roles are reversed in the latter case. There, in 
the P operators in addition a term emerges which is due to the initial-state particle a 
being the splitter and the other initial-state particle b being the spectator. Of course, 
if the corresponding dipoles do not emerge, due to the lack of coloured particles in 
either initial or final state, the respective terms are just dropped. 


3.3.3.5 Example: W production at NLO with Catani—Seymour 
subtraction 


In the following, only the real-emission part with a gluon in the final state will be 
considered, and of course, only the corresponding integrated terms need to be taken 
into account then. Starting with the real-emission contribution R(®p) and the corre- 
sponding subtraction term in four dimensions S(®z)°=°), the subtracted real-emission 
contribution reads 


doF-5) = dög [Ror = Sar) i (3.206) 


As before, cf. Eq. (3.134), the real contribution can be written as 


ota) aa | 
(3.207) 


where, again, the dimensionless quantity x = m#,/8 has been employed and where 
the real phase-space element including PDFs and flux factors is given by 


aa sag 27C ras(urR) 
x 


1 
dr = 57 dtudeg fusns (Lu, MP) Fang (ta HF) dPwg (3.208) 


with the final-state phase-space element for the {W g} system, d®wg, given in Eq. (3.139). 
At the same time, including averaging over incoming spins, the subtraction cross- 
section is constructed as 


do) = ia” Mge i 
4 


ud>Wt+ 


11 Dp. in ee eG 
| yroa oes yiu —% (3.209) 


2PuPg Tg, ud Tho 2P3Pg Tg, du Tig 


Using the Mandelstam identity Eq. (2.100) for this process, the recoil parameter £, „ā 
in the dipole of Eq. (3.195) is given by 


= PuPa—Po(Putpg)  ê+ê+û 
gud - = 
PuPa S 
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U> a 


To au ET (3.210) 


This is identical to the x of Eq. (3.133), in the first discussion of subtraction with 
dipole terms from a suitable ad-hoc definition. In Catani-Seymour subtraction the 
splitting kernel V49:4 in four dimensions, i.e. € = 0, reads 
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(3.212) 
This result is exactly the sum of the terms D(t, x) and D(é, x) from Eq. (3.136). Thus, 
for the subtracted real correction contribution one finally arrives at 
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This constitutes a perfectly finite result, as anticipated. 

Turning now to the virtual contribution for the ud —> W process and invoking 
Eq. (3.203) for the integrated dipole terms shows that the only relevant contributions 
to I(e) are those with both splitter and spectator in the initial state. Therefore 
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where it has been realized that, at Born-level, the centre-of-mass energy of the annihi- 
lating quark pair equals the W mass, consequently allowing the identification § = mj). 

Finally, the collinear subtraction terms from Eq. (3.202) have to be added. Speci- 
fying to the ud => Wg case only, so ignoring gluon-initiated processes, they are given 
by 
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The K and P operators can be obtained from Eqs. (3.204) and (3.205) with the 
contributing functions for the case at hand given by 


(e É) - 0 +e)log+Z* + 0-9 - 50-9 (5 | 
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Not surprisingly, the sum over all particles contained inside these terms collapses to 
the other initial-state particle only. Concentrating on the first term for the time being, 
and making the dependence on the PDFs explicit, therefore, 
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Here, as before, the colour insertion term T,,-Tz/T2 = —1 has already been evaluated 
and replaced. 

Together with the virtual contribution and the subtraction term I[,,7_,yw(e) this 
results in the following contribution to the overall cross-section: 
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The double and single poles 1/e? and 1/e cancel, which allows the prefactor to be 


replaced by unity, aac 
ER 
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Therefore the total contribution to the cross-section from these terms is, 
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Not surprisingly, this agrees with results that have previously been constructed in Sec- 
tions 2.2.5 and 3.3.2. Here, however, the full result is quoted, including the convolution 
with the PDFs. 


3.3.3.6 Practical aspects of the evaluation 


In the previous sections, the Catani-Seymour dipole subtraction has been introduced 
as a specific implementation of the more general idea of infrared subtraction. This 
method adds and subtracts terms to the virtual and real corrections in such a way 
that the resulting subtracted contributions are individually infrared-finite. Ultimately, 
this allows their phase-space integration with Monte Carlo techniques, which is neces- 
sary due to the complex structure of the high-dimensional integration region in such 
calculations. 
In other words, in 


gto fass 8, (das ues + Vn(®g; ur, uR) + TS) (®g; ur, uR) 


+ faor Ra (dei testn) ~ Salas tesin)| 
(3.220) 


the two integrals over d®g and d®R are individually finite and can therefore be inte- 
grated with the fairly evolved and efficient methods discussed in Section 3.2.2. 

There are two caveats, which have a purely technical reason. First of all, the 
collinear terms encountered in the do‘) contribution of Eq. (3.202) feature “+” func- 
tions. They are introduced in order to regulate possible divergences in the integration 
in the limit where the variable approaches 1. Looking at their definition in Eq. (2.10), 
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where g(z) is a regular function, this necessitates the inclusion of such terms — typ- 
ically the respective Born-level cross-sections — in the integration. This, however, is 
not difficult per se but merely a slight annoyance in actual implementations. The only 
tricky issue there consists in the fact that for z — 1 the divergence in f(z) is countered 
by the simultaneous vanishing of [g(z) — g(1)], which numerically is not always perfect 
due to the limited accuracy of numerical methods. One way to account for this is to set 
a numerically very small limit for the regulator [g(z) — g(1)] below which it is replaced 
by exactly zero, thereby setting the full integrand to zero.’ 

A second problem emerges in the subtracted real contribution, for similar reasons. 
By construction [R(®pz) — S(®z)| approaches zero in singular regions of the phase 
space r. This cancellation, however, is a cancellation of the type oo — oo, which 
notoriously is tricky to achieve numerically. It has therefore become customary to 
set the difference above to exactly zero, when two four-momenta in ®g become very 
collinear or one of the momenta becomes very soft. In reality this means that scalar 
products of the four-momenta are being checked, and if they fall below some very small 
cut-off value a, [R(®rz)—S(PR)] will be replaced by zero. The name of the game then 
is to optimize the choice of a in such a way that the overall result is numerically stable. 

Both these issues have been discussed and tested quite exhaustively in a number 
of automated implementations [536, 537, 584, 617]. 


3.3.3.7 Taming the growing number of dipole terms 


The two most popular subtraction methods, the algorithm of Frixione, Kunszt, and 
Signer (FKS) [539, 542] and the formalism of Catani and Seymour (CS) [344, 353], 
exhibit a different behaviour in the scaling of the number of subtraction terms Nsub 
with the number of external particles n. While, naively, the former method demands 
three subtraction terms per external particle, Ney, « 3”, the latter method requires 
two dipoles per pair of particles. Thus, in this case, Neup x 3"+)/2,. While this 
does not play any significant role for processes with only few external particles, n < 6 
or so, it becomes of course more pronounced with the number of external particles 
increasing. 

One way to handle this is to realize that the FKS method bases the construction of 
subtraction terms on a decomposition of the additional particle’s emission phase space 
in soft, collinear, and soft-collinear regions, with one term per region. Conversely, in 
non-singular regions of the phase space, no subtraction term is being constructed. In 
contrast, the dipoles of the CS method are always constructed, and the subtraction 
is performed over all phase space. This of course can be changed in such a way, that 
phase-space criteria are defined which allow to identify potentially dangerous, singular, 
regions of phase space based on the dipole kinematics. Then, in uncritical regions, no 
subtraction is being performed. One particular way is to define the criterion through 


9A typical value such a cut-off would be O (10-°), where 6 is the number of significant digits up 
to which the numerics of the program are stable; for double precision this is typically 6 > 10. 
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some Qdip, which parameterizes the phase space. For instance, for FF dipoles, the 
“constrained” dipoles are given by [781] 


‘gk = Dijk O(a — Yijik), (3.222) 


where yij;k has been defined in Eq. (3.161) and parameterizes the soft and collinear 
limits of the splitting, as can be seen in Eq. (3.166). This idea has been extended 
to other dipole configurations, namely to II, IF, and FI dipoles in [777], see also 
Appendix C.2. Of course, the integrated contributions, the I, K, and P operators, 
inherit this dependence, leading to some a dependent additional terms. 

This parameter-dependent identification of singular regions and the ensuing decom- 
position into subtracted and unsubtracted phase-space regions lead to a potentially 
large saving in the number of terms. In addition, it also allows a very convenient check, 
as the overall results for cross-sections must be parameter-independent. 


3.3.4 Next-to-leading order tools 


The methods presented in Section 3.3.1, to calculate one-loop amplitudes have been 
used to construct libraries of NLO calculations or even tools for their automated 
calculation. These tools typically need to be interfaced with other tools, which take 
care of the real correction contribution, the treatment of infrared singularities, phase- 
space integration, and the leading order part of the calculation. The latter two of 
these tasks — and quite often also the first two — are conveniently handled by tools 
presented in Table 3.2 in Section 3.2.3. The interface between these two classes of tools 
has been standardized in the Binoth—Les Houches accord [138, 250]. 

In Table 3.3, publicly available tools for the evaluation of one-loop amplitudes are 
listed and roughly categorized. This comprises 


e their scope: evaluating integrals (“integral evaluator” ), reducing the integrands to 
integrals (“integral reduction” ), acting as a library of NLO or one-loop calculations 
(“library”), or automated creation and calculation of one-loop amplitudes (“one- 
loop generator” ) 

e the way the amplitudes are being reduced and evaluated: Passarino—Veltman 
(“PV”), Ossola—Pittau-Papadopoulos (“OPP”), or OPENLOoPS (“OL”) reduc- 
tion. 


These tools are often supplemented with other, automated programs, that take 
care of the infrared subtraction of the real-emission corrections. These are listed in 
Table 3.4. Most of them are also capable of providing the real-emission matrix elements, 
taking care of the phase-space integration, etc. 


3.3.5 Next-to-leading order practicalities 


Once a calculation at NLO, or even higher order, has been completed, it has its greatest 
utility if it is available in a form in which a non-author can produce results for a specific 
kinematic configuration, or choice of scales, choice of PDFs, etc. Often the calculation is 
available in a program publicly available; there are also collections of such calculations, 
as for example are available in MCFM [311]. In some cases, especially for complex final 
states, the program may not be available per se but the information needed to provide 
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Table 3.3 Tools for calculating virtual corrections: there are different types of relevant tools, 
including those that list scalar (and other) basic integrals (“integrals”), reduce integrands 
(“reduction”), provide libraries of virtual corrections or the full cross-sections (“libraries”), 
or generate one-loop amplitudes automatically (“generators”). They use different reduction 
technologies, such as Passarino—Veltman (“PV”), Ossola—Pittau—Papadopoulos (“OPP”), or 
OPENLOoPS (“OL”) reduction. Some of them are set up to directly produce full calculations 
(“full”) including all contributions. 


type technology 
dependencies on other codes 
LOOPTOOLS [605] || integrals 
ONELOOP [879] integrals 
QCDLOoop [499] integrals 
COLLIER [457] reduction 
CUTTOOLS [793] reduction OPP 
FORMCALC [605] reduction PV 
NINJA [802] reduction Laurent expansion 
SAMURAI [756] reduction 
BLACKHAT [226] library (amplitudes) OPP (unitarity) 
MCcEM [309] library (full calculation) PV & OPP 
NJET [181] library (amplitudes) OPP 
GOSAM [420] generator (amplitudes) OPP 
SAMURAI +NINJA + 
MADLOoop [625] generator (full calculation) | OL+OPP 
CuTTOOLS + 
OPENLOOPS [338] || generator (amplitudes) OL+OPP 
COLLIER +CUTTOOLS + 
HELAC-NLO [241] || generator (full calculation) | OPP 
CuTTOOLS + 


a specific prediction is available in ROOT-ntuple form, where each event stores a 
phase-space point, along with the matrix elements and other information, such as 
the four-momenta of all final-state particles. At NLO, events are generated separately 
for each of four types of contributions: Born, virtual, integrated-subtraction, and real 
emission.!° A positive-definite physical cross-section is only formed by the addition 
of many events of all four types. The ntuples can be used with an analysis script 
to allow for the construction of cross-sections with the appropriate kinematic cuts 
corresponding to a given experimental analysis. 

For a process containing n partons at Born level, the emission of a real particle 
results in a (n+1) parton phase space. The subtracted real-emission events correspond 
to regions of this phase space where two massless partons become collinear or soft. 
These divergent regions have been regularized by a subtraction procedure, often the 


10 At higher multiplicities, a further sub-division may be optimal. 
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Table 3.4 Tools for automated infrared subtraction: there are different types of relevant 
tools, those including the full underlying Born-level amplitudes and tools for phase-space 
integration in the construction of the subtraction terms and those tools which only provide 
the differential and integrated subtraction terms to be added on top of matrix elements 
provided from outside. 


type technology 
dependencies on other codes 
AMEGIC++ [584] full: ME+PS CS dipoles 
SHERPA 
AUTODIPOLE |617] subtraction only | CS dipoles 
MADGRAPH 
COMIX [582] full: ME+PS CS dipoles 
SHERPA 
MADDIPOLE [537, 538] || full: ME+PS CS dipoles 
MADGRAPH 
MADFKS [536] full: ME+PS FKS subtraction 
MADGRAPH 
MATCHBOX [808] subtraction only | CS subtraction 


Catani—Seymour subtraction scheme discussed in Section 3.3.3. For this type of con- 
tribution, one event corresponds to the original emission, while the rest correspond to 
the subtraction terms. The dipole terms, evaluated in n-parton phase space, are then 
added back in the integrated subtraction events. For complex final states, there can 
be many Catani-Seymour subtraction terms and thus the need for many subtracted 
real-emission events in the ROOT ntuple. In fact, the subtracted real events take the 
majority of the disk space required for storing the ROOT-ntuples. The statistical un- 
certainty for events from the Born, virtual, and integrated subtraction ntuples can 
be calculated in the standard Monte Carlo way. As the real-emission and the corre- 
sponding subtraction configurations for a given event are strongly anti-corrrelated, the 
anti-correlation must be taken into account in order not to over-estimate the statistical 
error. It is possible to use a number of different jet algorithms and sizes, as long as 
the appropriate subtracted real counter-events are present. Typically, a wide range of 
jet sizes, R € [0.2, 1], can be allowed with only a small overhead (number of additional 
subtracted real events needed). 

As mentioned above, the partons in each event can be re-clustered on the fly to 
form cross-sections for different jet algorithms, for example using FASTJET. The ap- 
propriate weight information is stored for each event, allowing the matrix element for 
that event to be reweighted for different PDFs (and the appropriate value of a;(mz) 
for that PDF), and for different values of the renormalization and factorization scales. 
Thus, the PDF, a,(mz) and scale uncertainties can be automatically calculated in 
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one run over the ntuples. For the Born and subtracted real events, this reweighting 
is relatively straightforward. The virtual events have an additional dependence on 
the renormalization scale resulting from the one-loop amplitudes. The integrated sub- 
traction events also depend on the factorization scale and the PDFs as a result of 
off-diagonal splittings. 

There is a standard format for storing information from NLO calculations in ROOT 
ntuples that was originally developed by the BLACKHAT +SHERPA collaboration [233], 
but has now been adopted for use in a number of NLO calculations from other groups 
as well [182, 420]. Thus, an analysis code developed for the evaluation of ROOT ntuple 
events for one process, can easily be adapted for use with any other NLO process. 

Consider the production of Higgs (+ > 3) jets through gg fusion at NLO as an ex- 
ample. This calculation has been performed by the GOSAM collaboration [420] and the 
output has been made available in the BLACKHAT +SHERPA ROOT ntuple format.!! 
This is one of the most difficult NLO calculations carried out to-date. For cross-section 
predictions for reasonable statistical uncertainties at 13 TeV, approximately 70 GB 
of disk storage is needed for the Born ntuples, 2.5 GB for the virtual ntuples, 130 
GB for the integrated subtraction ntuples and 2.2 TB for the subtracted real ntuples. 
As discussed earlier, the subtracted real events by far require the most storage space. 
Although the total disk space required is large, each ntuple file is restricted to a size 
of a few GB, allowing for the option of parallel processing. 

In Fig. 3.8 (left), the cross-section for the production of a third jet for gg > H+ > 3 
jets at 13 TeV is shown as a function of the third jet transverse momentum. Jets are 
clustered using the anti-kr (D = 0.4) jet algorithm and must have a minimum trans- 
verse momentum of 30 GeV and a maximum absolute rapidity of 4.4. The prediction 
in the ntuple uses the CT10 NLO PDFs and the calculation is carried out at a central 
scale of up = urp = Hy /2. The reweighting information in the ntuples was used to 
produce the PDF uncertainty band (using the CT10 Hessian error set) and the scale 
uncertainty band (varying the renormalization and factorization scales independently 
up and down by a factor of two, while keeping the difference between the two scales 
within a factor of two). The scale uncertainty dominates except at high pr where the 
PDF uncertainty becomes comparable. 

Fig. 3.8 (right) shows the jet mass distribution for the third jet, calculated from 
the same ntuples. The scale dependence is significantly larger than for the jet pr 
distribution since the jet mass, in this context, is a leading-order quantity, non-zero 
only when an additional gluon is emitted. While the high jet mass behaviour shown 
in the figure is reasonable, the low mass region lacks the Sudakov suppression that 
would be present in a resummation calculation or parton shower Monte Carlo. Similar 
distributions can be calculated for the jet mass distributions for the leading and second 
leading jets, with the same caveat. 


11Qne of the authors (JH) is grateful to the GOSAM collaboration for providing and assisting in the 
use of these ntuples. 
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Fig. 3.8 The third jet transverse momentum distribution (left) and third 
jet mass distribution (right), for Higgs + > 3 jets, calculated at NLO using 
the GOSAM ROOT ntuples. The predictions are shown with scale and PDF 
uncertainties. 


3.4 Beyond next-to-leading order in QCD 
3.4.1 Next-to-next-to-leading order 


Before discussing some of the technical aspects of calculations at NNLO, it is useful to 
review the ingredients that must enter such a calculation. Consider a computation of 
the cross-section for the the production of a Z boson and a jet, which has already been 
discussed at some length at NLO. At NLO this calculation involves one-loop diagrams 
and real radiation diagrams, with some mechanism for cancelling the divergences that 


are present in each contribution. What is the corresponding situation at NNLO? 


One way to understand the different contributions is to consider all possible distinct 
cuts of the O(a?) four-loop diagram shown in Fig. 3.9. In total there are four types of 
contribution, corresponding to cuts (a)—(d) in the diagram. They can be understood 


as follows. 


(a) The first contribution corresponds to an interference involving a two-loop diagram 


on one side of the cut and can be written schematically as 


Re [AVP (Zaqqg) x A" (Zqgg)"] - (3.223) 


In the past this type of contribution had been the focus of the most attention 


since the evaluation of two-loop amplitudes is highly non-trivial. 


(b) This contribution corresponds to the square of the one-loop three-parton matrix 


elements, 
1—loop _ \/2 
| At "°°? (Zaqg)| 


(3.224) 


Note that these are the same amplitudes that appear in the NLO calculation, 
although in that case they enter as an interference with the tree-level amplitude. 


(c) The third contribution also contains one-loop matrix elements, this time with four 


partons, and enters interfered with the corresponding tree-level amplitude, 


Re [Al !°P (Zaqgg) x A™°*(Zqqgq)"] - (3.225) 
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Fig. 3.9 A four-loop diagram representing NNLO contributions to the 
Z+jet cross-section (the Z boson is shown as an external wavy line). The 
diagram should be cut in all possible ways, shown by the dashed lines 
(a)-(d), with the cut contributions described in the text. 


Notice that exactly this contribution would be present in the NLO calculation of 
Z +2 jet production. The difference in this case is that one of the partons may 
be unresolved, leading to additional soft and collinear singularities. 


(d) The final contribution involves only tree-level matrix elements that contain five 
partons, 


|4 (Zqgggg)|" - (3.226) 


In this case two partons may be unresolved, giving rise to singularities of a new 
form than encountered at NLO. Isolating these has provided the toughest chal- 
lenge to completing such NNLO calculations. 


Any NNLO calculation will therefore be substantially more complicated than its 
NLO counterpart and involve the introduction of new techniques. Unlike at NLO, 
at present there is no automatic procedure for generating predictions at this level 
and calculations are currently being performed on a case-by-case basis. It is therefore 
important to understand the benefits of having a NNLO calculation at hand in each 
case. Of course, by extending the perturbative calculation by an additional order, one 
expects that the quality of the prediction should improve. Certainly more effects can 
be accounted for at this order in perturbation theory. For instance, contribution (d) 
allows a single jet to be composed of three partons, a situation that is impossible at 
NLO. Such configurations are more sensitive to details of the jet algorithm that may 
be reflected in real data. In addition, as already discussed in Section 2.2.6, the scale 
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Fig. 3.10 Predictions for the rapidity distribution of an on-shell Z boson 
in Run II at the TEVATRON. The bands indicate the scale uncertainty of the 
LO, NLO and NNLO predictions, when renormalization and factorization 
scales are varied within the range Mz/2 to 2Mz. 


uncertainty is expected to be further reduced at NNLO compared to NLO. Finally, 
in cases where NLO corrections are large, it is a chance to check the convergence 
of the perturbative expansion and, hopefully, regain good theoretical control of the 
prediction. Of all of the motivations, the issue of paramount importance is obtaining 
a theoretical prediction with as small an uncertainty as possible in order to extract 
information about fundamental parameters of the Standard Model. 

Most of these factors are exhibited by the calculation of the Z rapidity spectrum 
at the TEVATRON up to NNLO [154]. As shown in Fig. 3.10 the normalization and the 
shape of this distribution changes significantly between LO and NLO. However the 
inclusion of NNLO corrections does little to further alter the shape and the overall 
size of the correction amounts to only a few per cent. Finally, the scale uncertainty 
which is represented by the width of the bands in the figure, is greatly reduced with 
each successive order. Of course, the improved theoretical prediction for the Drell-Yan 
process is essential for calibrating many measurements at the TEVATRON and the LHC, 
cf. Chapters 8 and 9 

Returning to the elements of a NNLO calculation, one of the most complicated 
aspects is the evaluation of the two-loop diagrams. One reason for this is that the 
integrals themselves have a much richer analytic structure and, importantly, a higher 
degree of divergence. At one-loop the singularities occur when the momentum flowing 
through the loop puts one of the internal particles on its mass shell and, in dimensional 
regularization, this results in poles as deep as 1/e?. For two-loop amplitudes there are 
two such unconstrained momenta in the loop, so that the leading pole can be as high 
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as 1/e*. Furthermore, at one loop every amplitude can be expressed in terms of a 
basis of scalar integrals containing up to four propagators, cf. Eq. (3.103). At the 
two-loop level no such basis is known. Instead the usual procedure for performing 
such a calculation is as follows. All of the integrals appearing in the calculation of 
the amplitude are collected and reduced to a set of master integrals, to which all 
the others can be straightforwardly related. This reduction usually proceeds through 
the Laporta algorithm [721] which takes advantage of integration-by-parts [393] 
and Lorentz-invariance [559] identities and symmetry relations. A number of tools for 
performing this reduction, through variants of this algorithm, are commonly available. 
At that point the remaining work lies in evaluating the master integrals. For the most 
complicated cases, the current method of choice is to solve a system of differential 
equations and match to known solutions in particular limits. At present most of the 
important amplitudes for 2 — 1 and 2 — 2 processes are known, with the most 
complicated being those relevant for pair production of massive gauge bosons. However, 
relatively little is known about amplitudes for 2 — 3 processes and beyond. 

With the two-loop contribution in hand, the remaining difficulty — which is very 
significant indeed — is to isolate and cancel the various singularities that enter all of 
the contributions. For contributions of type (c), corresponding to one-loop amplitudes 
in which an external particle may become unresolved, the form of the singularities and 
methods for handling them have been known for some time (see for instance Ref. [184] 
and references therein). However, a general algorithm, along the lines of the Catani- 
Seymour dipole subtraction method developed for NLO, has not yet been formulated. 
The elements of the calculation that are hardest to handle correspond to contribution 
(d), when two of the partons are unresolved. The factorization of both the amplitudes 
and the phase space in these limits is more complicated than in the single-unresolved 
case. For instance, except in special cases the factorization of amplitudes does not 
involve a product of two Altarelli—Parisi splitting functions and instead new functions 
must be introduced to describe all limits [319, 349]. 

So far, a number of techniques have been successfully applied in order to isolate 
and cancel the singularities appearing in NNLO calculations. The method of sec- 
tor decomposition has been used in the calculation of corrections to the Drell-Yan 
process [159] and to Higgs production [160]. In this approach a special factorization 
of the phase space is used to ensure that all singularities can be extracted analyti- 
cally by decomposing in terms of simple plus distributions, as in Eq. (A.7). A similar 
approach, termed sector-improved residue subtraction [425], has been used to 
compute NNLO corrections to top pair production [427] and the all-gluonic Higgs+jet 
channel [270]. An alternative approach, based on well-known properties of transverse 
momentum distributions in simple processes, is called gr subtraction [350]. It has 
been applied to Higgs production [594], vector boson production [341], associated 
WH and ZH production [524], photon pair production [399] and top pair produc- 
tion [265]. A closely related method is jettiness subtraction [274, 555] which uses 
ideas from soft-collinear effective theory (SCET) to isolate and cancel infrared 
singularities. It has been used to compute a number of 2 — 2 processes at NNLO 
such as Higgs+jet [271], W+jet [274], and Z+jet production [269]. Last, the method 
of antenna subtraction [563] has been used to compute NNLO corrections to dijet 
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production through gluon fusion [566] and Higgs+jet [392] and Z+jet production [564]. 
This approach is most similar to the subtraction methods discussed at NLO above, 
with counter-terms constructed that can render individual contributions finite but 
that can also be analytically integrated to explicitly collect singularities. For example, 
returning to the example above, the schematic form of the NNLO contribution to the 
cross-section for the parton-level process i + j + Z-++parton would be 


daNNLO _ J di [ag + 6) — 6G] + f da [da — a?) + j ds [aay — 66°] . 


In these equations the superscripts label the different contributions indicated in Fig. 3.9. 
Each contribution is integrated over the appropriate phase space for n final-state par- 
ticles, ®,. The counter-terms for each contribution are indicated by os. In this for- 
mulation the counter-term C3, for instance, removes all singularities resulting from 
single- and double-unresolved limits of the contribution from the matrix elements (d). 

Of course, not all calculations at NNLO require the full machinery discussed here. 
In particular, for the simplest 2 — 1 and 2 — 2 processes, particularly if one is only 
interested in total cross-sections and not exclusive properties of the final-state particles, 
the calculations can be performed more simply. In fact, for the most important such 
cases NNLO results have been available for some time. The total inclusive cross-section 
for the Drell-Yan process, production of a lepton pair by a W or Z in a hadronic 
collision, has long been known to NNLO accuracy [607]. Similarly, the inclusive Higgs 
boson cross-section, which is a one-scale problem in the limit of large top mass, was 
also first computed at NNLO some time ago [158, 615]. 

To conclude, the frontier of NNLO calculations is currently evolving very rapidly. 
There are many competing methods for performing the calculations and undoubtedly 
these techniques will continue to be honed. At present, 2 — 2 reactions can be com- 
fortably handled, even with coloured and massive objects in the final state. These 
pioneering calculations will lead to an even greater availability of NNLO predictions 
in the near future. An important final note is that, for a consistent description of 
the entire hard process at NNLO, all of the calculations discussed above rely on the 
availability of parton densities evolved at the same order. Such PDF sets are indeed 
available, as will be discussed in detail in Chapter 6, thanks to the calculation of the 
QCD three-loop splitting functions [767, 883]. 


3.4.2 Approximate NNLO: LoopSim 


Rather than computing exact NNLO corrections it may instead be useful to con- 
sider an approximation of the full result that captures some of the most important 
effects that enter at that order. One such approximation is provided by the LoopSim 
method [830], that was developed to handle observables that receive significant higher 
order corrections resulting from new channels or topologies (cf. Section 2.2.7). 

In this method, contributions (b), (c), and (d) are treated exactly. Only the two- 
loop contribution (a) is approximated, although its singular behaviour is of course 
known and can thus be included in its exact form. Given a set of input momenta 
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Fig. 3.11 Comparison between the approximate NNLO result from Loop- 
Sim (blue) and the exact result from DYNNLO (red). The corresponding 
NLO result is shown in green. Reprinted with permission from Ref. [830]. 


that correspond to the real radiation contributions entering at NNLO, the method 
finds a corresponding Born configuration and sequence of subsequent emissions. This 
step can be performed using a sequential-recombination jet algorithm, for which the 
Cambridge—Aachen algorithm is preferred. By considering all possible ways in which 
emitted particles can be combined with emitters, LoopSim determines exactly the 
singular (or logarithmic) contributions of loop diagrams, which unitarize the corre- 
sponding singular terms in the real radiation diagrams.'? This approximate NNLO 
prediction is then finite and differs from the full NNLO calculation only by constant 
terms. Indeed, the expected precision of the method for an observable A is, 


do NNLO NNLO 2 
PLoopSim _ de 1+0 Os (3.228) 
dA dA KNNLO(A) 


where doNN¥9 /dA = KNN4°(A) dozto/dA defines the “local” NNLO K factor as a 
function of A. As KNN4°(A) increases, the quality of the approximation is expected 
to dramatically improve. In cases where this can be explicitly tested, such as the Drell- 
Yan process, this expectation is confirmed, as shown in Fig. 3.11. Comparisons of the 
predictions of this method with LHC data representing more complicated final states 
will be discussed in Chapter 9. 


12As such, this method bears an interesting similarity to multijet merging methods for matrix 
elements and parton showers that will be discussed in Chapter 5. At present this connection has not 
been fully explored. 
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3.4.3 Beyond NNLO 


Since NNLO predictions for hadron collider observables are only just becoming more 
widespread, the prospects for extending these to even higher orders in perturbation 
theory are hard to assess. However, given the complexity of any such calculation, there 
must be a very strong need for an improved theoretical prediction. The case of the 
Higgs cross-section is a strong candidate for such a calculation since this knowledge 
could be put to use immediately in trying to unravel the properties, such as coupling 
strengths, of the Higgs boson. This calculation is also among the most simple that 
could be conceived at a hadron collider, since it is a 2 — 1 calculation involving a 
scalar particle. 

For this reason there has already been a remarkable amount of progress on comput- 
ing the Higgs boson cross-section at N?LO. The first partial results for this calculation 
were presented in Ref. [155], where the calculation is performed as an expansion about 
the threshold region. Specifically, the partonic cross-section is expanded as, 


Gig (m3, 8) x 5 (=) Po), (3.229) 


k=0 


where z = m?;/8 so that (1 — z) is the variable that parameterizes the distance from 
threshold. The results for the first term in the expansion of n?)(z) are presented in 
Ref. [155], neglecting terms of order (1 — z). The same technique is of course also 
applicable to the Drell-Yan process, where a similar level of approximation has also 
already been applied [129]. The phenomenological impact of these results is ambiguous, 
due to the fact that there can be substantial differences resulting from equivalent 
parameterizations of the neglected terms. Nevertheless, the same methodology has very 
recently been extended to compute the sub-leading terms to an arbitrary level [157]. 
This effectively results in a full N°LO calculation, paving the way for a similar level 
of precision for a range of inclusive cross-sections. 


3.4.4 Detour: Electroweak corrections 


When making theoretical predictions for a hadron collider it is natural to consider a 
perturbative expansion in the strong coupling, as described so far. The hard scattering 
must involve strongly interacting particles and the expansion parameter as is an order 
of magnitude larger than the corresponding electroweak coupling, ay. Nevertheless, 
one may still be interested in the effect of electroweak corrections in a number of 
circumstances. 

First of all, simply estimating the size of the corrections from the numerical values 
of the couplings, one might expect that, since ay ~ a2, the effect of NLO EW cor- 
rections should be considered at the same time as NNLO QCD, when the precision of 
the theoretical precision is paramount. A further motivation, that has received con- 
siderable attention in recent years, is due to the expected nature of electroweak effects 
at high energies. As in QCD, the emission of gauge bosons in the electroweak theory 
can be described by an eikonal factor. The crucial difference is that, when integrated 
over phase space, the mass of the W and Z bosons provides a cut-off for the integral 
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so that it does not diverge. As a result the eikonal approximation results in the EW 
Sudakov factor, 


& Qw 2 S x 

CEW real Y Jr log (=) T0. (3.230) 
where s is the hard scale at which the process is being probed, cf. the QCD equivalent 
in Chapter 2. There is a corresponding virtual contribution, generated by one-loop 
diagrams in which a W or Z boson is exchanged internally, and just as in the QCD 
case this enters with the opposite sign, 


a s 
CRW virtual ~ S log” Pa Go. (3.231) 
However, there is a crucial difference between the QCD and electroweak cases. The Su- 
dakov factor is associated with a combination of isospin generators and, for a fixed ini- 
tial state, there is a mismatch when the forms appearing in Eq. (3.230) and Eq. (3.231) 
are promoted to exact relations [866]. This violation of the Bloch—Nordsieck theorem is 
due to electroweak symmetry breaking and the fact that the initial states present in a 
given process are not averaged over, but weighted by different PDFs. In addition, and 
perhaps even more importantly for the interpretation of collider data, it is of course 
straightforward to isolate the effects of EW radiation in data — unlike the case of QCD 
radiation. The lack of infrared divergences means that it is customary that events that 
are identified as containing additional W or Z bosons form a separate data sample. 
Of course, there are regions where W and Z radiation escapes detection and some 
effect from these contributions can partially cancel the virtual corrections, depending 
on the particular process and the experimental setup [206]. However, the net effect of 
real EW radiation is typically rather small and therefore the virtual corrections have 
been the subject of the most intense theoretical scrutiny. 

Note that the form of the Sudakov correction in Eq. (3.231), that is enhanced by 
two powers of the logarithm, multiplies the leading-order amplitude. This means that 
it is easy to estimate the size of the leading relative EW corrections for the simplest 


processes, 
EW virtual 
SEW -Z z = — (constant) T log? (=) ; (3.232) 


This approximation is rather crude since it assumes that all relevant kinematic scales 
are large and can be approximated by a single value s, for instance for a 2 + 2 process 
s & |t| ~ |u|. The constant that appears in Eq. (3.232) is a combination of isospin 
Casimirs and depends on the identities of the particles participating in the reaction. It 
can be written in terms of electroweak parameters such as the weak mixing angle and is 
usually of order unity. To illustrate the size of the correction that can be anticipated, 
Fig. 3.12 shows the value of SEW obtained from Eq. (3.232) as a function of v's, 
with the constant set to 1. In reality this constant varies for each sub-process and the 
large negative contribution is partially mitigated by collinear (single) logarithms, but 
Fig. 3.12 gives a good guide to the size of the corrections that can be expected. With 
corrections of order 10% or larger for scales of 1 TeV and beyond, in regions of large 
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Fig. 3.13 Sample box diagrams entering the NLO EW corrections to the 
process qq — tt. 


invariant mass or transverse momentum it is possible that the EW corrections can be 
as large as their QCD counterparts. 

A complication is that the consistent inclusion of EW effects often means the cal- 
culation of a wider set of contributions that must be handled with care. As a concrete 
example, consider the reaction qq — tt that proceeds at O(as) through a single di- 
agram with an s-channel gluon. The virtual electroweak corrections to this process 
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include a number of self-energy, vertex, and box diagram contributions. One of the 
contributing box diagram contributions is shown in panel (a) of Fig. 3.13. It consists of 
the tree-level O(as) process interfered with the O (asaw) one-loop correction involving 
an internal Z-boson in the loop. However, at the same order one must also consider the 
interference of the diagrams shown in panel (b): the tree-level O(a,,) process and the 
one-loop O(a?) box. In this way the weak and strong amplitudes become entangled 
when computing EW corrections if a given final state may be obtained at tree-level 
through both strong and weak interactions. In fact the inclusion of all the diagrams in 
Fig. 3.13 results in an infrared-divergent contribution that must be cancelled by the 
radiation of real gluons. These diagrams correspond to dressing the tree-level ones in 
Fig. 3.13 with a gluon, to obtain O(as,/a,) and O(aw/a,) amplitudes, and perform- 
ing the interference. Singularities can be extracted and cancelled against the virtual 
contributions using the same methods, such as dipole subtraction, that are applied in 
the pure QCD case [704]. 

Although the motivation for including EW corrections has been presented in terms 
of Sudakov logarithms, for the simplest processes it is possible to compute the correc- 
tions exactly, to include all sub-leading effects. In this way EW corrections have been 
obtained and investigated for most 2 — 2 processes, including the production of dijets, 
vector bosons and a jet, top pairs, and dibosons. A recent review [765] summarizes 
the most important results and contains references to the original calculations. Alter- 
natively, one can use the known factorization properties of the Sudakov logarithms to 
obtain an approximate form of the EW corrections [461], a strategy that can be used 
to provide the corrections for more complex final states [395]. 


3.5 Summary 


This chapter has discussed in some detail the essential elements involved in making 
perturbative predictions for hadron colliders. There has been rapid progress in this area 
in the years leading up to data-taking at the LHC, as the theoretical predictions have 
had to evolve to match the expected breadth and precision afforded by such a machine. 
The latest tools are able to provide NLO corrections for configurations involving many 
jets and NNLO predictions and beyond for the most important processes. In many 
cases even the effect of electroweak corrections can be included. 

Poised at the brink of yet another substantial jump in machine capability — not 
only an increase in energy, but also an unprecedented amount of data resulting from 
the increased luminosity — it is imperative that perturbative predictions continue to 
improve at a similar rate. The preparation of the “Les Houches wishlist” [294] has 
provided a forum for discussing the most useful calculations that could be performed, 
in order to ensure that such progress is achieved. By now, all of the NLO perturbative 
QCD calculations contained in the original list have been completed, primarily due 
to the emergence of the unitarity techniques discussed in Section 3.3.1. The latest 
iteration of the list [296] reflects the high-precision calculations that are expected 
to be required during the expected lifetime of the LHC snd typically demands the 
inclusion of both NNLO QCD and NLO EW effects. As an example, Table 3.5 shows 
the Higgs-related calculations for which a strong need is anticipated in the future. 
For each calculation presented in the table, Ref. [296] discusses the motivation and 
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degree of need, which is particularly important given the challenging nature of many 
of the demands. Note that such discussions are certainly not limited to the case of 
Higgs boson cross-sections and Ref. [296] contains wishlists for other Standard Model 
processes. 

As an example of the motivation, consider the Higgs + 2 jets final state. This 
channel is crucial in order to understand the Higgs boson coupling to vector bosons, 
through the vector boson fusion (VBF) channel. For the VBF channel the NNLO QCD 
corrections are known in a fully differential form, in the double-DIS approximation, 
and EW effects are known to NLO. However, the search for this production mode 
suffers from a background consisting of Higgs production through gluon fusion, when 
two additional hard jets are also radiated. Currently this channel is known only to 
NLO QCD in the infinite top mass approximation, and only to LO QCD when the 
full dependence on the top quark mass is retained. If both the VBF and gluon fusion 
Higgs + 2 jets cross-section are known to NNLO QCD, and NLO EW, accuracy then 
with 300fb~! of data it may be possible to measure the HWW coupling strength to 
the order of 5%. 

Having discussed the frontier of fixed-order, parton-level treatments, the following 
chapter will describe in more detail the application of these predictions to a wide 
range of hadron collider processes. A variety of alternative approaches that go beyond 
the ones presented so far will be discussed in Chapter 5. These represent all-orders 
treatments that either address regions where the calculations presented here break 
down, or which in addition allow predictions to be made at the hadron level for a 
direct comparison with data. 
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Table 3.5 The “Les Houches wishlist” for processes involving the Higgs boson, taken from 
Ref. [296]. 


Process Comments 

A State of the Art 

da @ NNLO QCD (expansion in 1/m:) 

full m:/m», dependence @ NLO QCD and @ NLO EW 
NNLO+PS, in the m; > co limit 

Desired 

da @ NNNLO QCD (infinite-m, limit) 

full m:/m», dependence @ NNLO QCD and @ NNLO QCD+EW 
NNLO+PS with finite top quark mass effects 

H +j State of the Art 

da @ NNLO QCD (g only) 

and finite-quark-mass effects @ LO QCD and LO EW 
Desired 

do @ NNLO QCD (infinite-m; limit) 

and finite-quark-mass effects @ NLO QCD and NLO EW 
H + 2j State of the Art 

Ttot (VBF) @ NNLO(DIS) QCD and do(VBF) @ NLO EW 
do(gg) @ NLO QCD (infinite-m, limit) 

and finite-quark-mass effects @ LO QCD 

Desired 

do (VBF) @ NNLO QCD + NLO EW 

do(gg) @ NNLO QCD (infinite-m, limit) 

and finite-quark-mass effects @ NLO QCD and NLO EW 
H +V State of the Art 

do @ NNLO QCD and do @ NLO EW 

Ctot(gg) @ NLO QCD (infinite-m, limit) 

Desired 

with H — bb @ same accuracy 

do(gg) @ NLO QCD with full m;:/m», dependence 

tH and State of the Art 

tH do(stable top) @ LO QCD 

Desired 

da(top decays) @ NLO QCD and NLO EW 

ttH State of the Art 

da(stable tops) @ NLO QCD 

Desired 

do (top decays) @ NLO QCD and NLO EW 

gg —> HH | State of the Art 

da @ NLO QCD (leading m, dependence) 

da @ NNLO QCD (infinite-m; limit) 

Desired 

do @ NLO QCD with full m:/my, dependence 


4 
QCD at Fixed Order: Processes 


In Chapter 2 the description of hadron collider processes at fixed order in pertu- 
bation theory was introduced and in Chapter 3 the technology for performing such 
calculations was reviewed. This chapter will discuss the application of these ideas 
to describing important Standard Model processes that can be probed at the LHC 
and in other hadron collider experiments. In particular, theoretical issues related to 
calculations for more complex final states will be addressed. The discussion of these 
processes begins with jet production in Section 4.1. This is followed by a discussion of 
processes that are, in a theoretical sense, closely related: the production of photons in 
association with jets (Section 4.2). The extension to the production of gauge bosons 
plus, potentially, some jets is the focus of Section 4.3. Other processes considered in- 
clude the production of pairs of electroweak bosons (Section 4.4), top pair production 
(Section 4.5) and single-top processes (Section 4.6). A selection of processes that are 
more rare than these are briefly discussed in Section 4.7. The chapter concludes in 
Section 4.8 with an overview of the main channels by which a Higgs boson is pro- 
duced and observed in the LHC experiments. These processes will remain central to 
the continuing LHC program for the foreseeable future. 

A more in-depth comparison of these fixed order results with data will be presented 
in later chapters, cf. Chapters 8 and 9. 


4.1 Production of jets 


In the previous chapter, an outline of the perturbative approach was introduced by 
considering the production of a lepton and neutrino through the weak interaction. A 
far more likely outcome of a collision is the production of a final state through the 
strong interaction, which in the partonic picture is represented by quarks and gluons. 
In contrast to the simplest examples discussed previously, in this case even at LO there 
are many contributions corresponding to various combinations of partons in both the 
initial and final states. Therefore the two-jet cross-section is given by (cf. Eq. (2.52)), 


dza dx 
O2-jet = Z5 e T fajta (Za, LE) fo/no( Tb, HF ) faan IMab—seal”, (4.1) 


a, ib,c,d 9 
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Fig. 4.1 Representative Feynman diagrams for four categories of LO 
reactions entering the calculation of two-jet production at hadron colliders. 
These are: qq — qq’ (top left), qg — qg (top right), qg gg (bottom left), 
and gg > gg (bottom right). 


where the sum runs over all permissible combinations of quarks, anti-quarks, and glu- 
ons. Four such parton-level processes that must be included at leading order in the 
strong coupling, which shall be discussed in more detail shortly, are shown in Fig. 4.1. 
These interactions should produce strongly interacting particles in opposite hemi- 
spheres, with zero net transverse momentum. The rate for this process is extremely 
high at the LHC, with cross-sections for typical jet cuts, pr(jet) > 50 GeV, in the 
tens of microbarn range, cf. Fig. 4.2. The cross-section for two jets with transverse 
momenta of 1 TeV or higher is still at the level of a nanobarn. Such energetic jets 
are of course much easier to produce at a possible future hadron collider: 1 TeV jets 
at a 100 TeV collider are just as common as 100 GeV jets at a 14 TeV LHC, with a 
cross-section of a few microbarns. 

This process represents an important probe of QCD in a number of ways. For 
instance, by measuring cross-sections as a function of jet transverse momenta it is 
possible to assess the running of the strong coupling. The large cross-section enables 
measurements of this process to be made for a wide range of transverse momenta and 
rapidities that, due to the very simple kinematics involved, can easily be translated 
into information about the PDFs (as will be discussed further in Chapter 6). Besides 
these areas where the two-jet process itself is the process of interest, jet production 
can easily be a source of background events for many analyses, for instance when one 
of the jets is misidentified as a lepton or a photon, or when mis-measurement of one 
of the jets results in substantial missing transverse momentum. Since the cross-section 
is so large, even a very small fake rate can lead to significant background rates. It is 
therefore essential to understand this process in some detail. 
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Fig. 4.2 Dijet production cross-sections at proton—proton colliders as a 
function of centre-of-mass operating energy, vs. Three values of the min- 
imum jet transverse momentum are considered: 50 GeV (solid), 100 GeV 
(dashed), and 1 TeV (dotted). 


4.1.1 Dijet production 


In the simple picture outlined so far the theoretical definition of the dijet process is 
clear, a 2 + 2 scattering involving only quarks and gluons. For the sake of illustration 
consider the four sub-processes, the reactions qq’ —> qq’, qg > 4g , 4q —> gg, and 
gg — gg that are depicted in Fig. 4.1. The matrix elements for these processes, summed 
over final-state colour and spins and averaged over the colour and spins in the initial 
state, are, 


g 4V2 Mgg—=>gg — V 


|\Mag >a -= aP toe Solas = = (55) ; (4.2) 
Mize) = ag ME? sgg = = (= a) (8? + a?) , (4.3) 
Misael = aw tie a ow (+ ~~) (E +02), (4.4) 
IMgg—99 Er soba = 2i (3 ut = =) i (4.5) 
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which also implicitly defines the non-averaged leading-order matrix elements m(). The 
matrix elements are written in terms of the usual invariant quantities § = (pı + p2)?, 
f = (pı — p3)? and û = (p2 — p3)?. Note that, as is clear from Fig. 4.1, the diagrams 
entering the calculations of the reactions qg —> qg and qq — gg are identical. The 
processes differ only by the identities of the partons that are present in the initial state. 
Therefore the matrix elements are expected to be related by a crossing symmetry, 
in this case the exchange of pg and —p3, where the minus sign accounts for the fact 
that the particles are exchanged between the initial and final states. By noting that 
this corresponds to the exchange § + f, the crossing relation can be straightforwardly 
verified by inspecting Eq. (4.5), up to an overall factor related to colour averaging in 
the initial state and a minus sign from crossing. 

To turn these matrix elements into a cross-section they must be combined with 
the appropriate two-particle phase space. As shown explicitly in Appendix A.3, the 
phase space for two massless particles can be written directly in terms of the transverse 
momentum and rapidity of one of them as 


_ pidpidnde 


d®2 2(27)3 


(27) 6 ((p1 + p2 — ps)*) (4.6) 


where it is convenient to work in the lab frame, in which the four-momentum of the 
first jet can be written as, 


p3 = pı (cosh n, sin ¢, cos ¢, sinh 7) (4.7) 


(see Appendix A.3). After performing the trivial integration over ¢ and using the p1 
integration to remove the 6 function, the phase space takes the simple form, 


2 
db, = —Pt 


i 
— FL dn, 4.8 
me (4.8) 


U> 


This is the phase-space element for a jet of a given transverse momentum (p1), with 
the remaining degree of freedom parameterized by its rapidity (7). An even more useful 
variable is one that reflects more directly the kinematic configuration of both jets. To 
that end it is convenient to write the momentum of the other jet as 


pa = p1 (cosh 7, — sin ġ, — cos ¢, sinh 7’) (4.9) 


so that momentum conservation (pı - p2 = p3- p4) then implies the relation 


— 1 
V8 = 2p. cosh (2 7 x) = PL (=) ; (4.10) 


where x = exp(73 — 74). The variable x is clearly a natural one to describe the 
problem and, since it only depends on the difference of two rapidities, it is invariant 
under longitudinal boosts. This property means that it has a very simple definition 
in terms of the angle 6 between the jet and the beam in the centre-of-mass frame. To 
see this relation it is useful to write expressions for the invariants Ê and å in the two 
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frames, 


(1 + cos) = ae (4.11) 


From these it is clear that x and @ are related by 


E 1 + cos 0 


— eee 4.12 
1—cos@ ( ) 


X 
Performing the change of variables dy + dy/x in Eq. (4.8) yields a particularly simple 
parameterization of the phase space, 


1 dy 


d = — ~. 
>= Ge G+ DP 


(4.13) 


Finally, combining the final-state phase space in Eq. (4.13) with the matrix ele- 
ments of Eq. (4.5) recast in terms of x, yields the relevant quantities for each partonic 


channel, 
1 V x i 
2 
də |Maz >ar = = sn ox 14 (5) ; (4.14) 
2 1 1 
d®2 |Mag>ag| = Tn aN2 ox (4.15) 
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1 V dy 1 1+x? 
dë as SA A 2N? 4.1 
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1 2N? (l+x+ x7)? 
an V X FI 


de, Massaal" = (4.17) 


With the exception of the gg — gg reaction, these expressions depend rather weakly 
on x in the physical region, x > 1. As a result, a measurement of do/dyx for jet 
production is quite insensitive to the details of the parton distribution functions that 
have yet to be folded in. Indeed, this weak dependence on y is none other than the 
statement that these processes are dominated by the t-channel exchange of a spin-1 
gluon, analogous to Rutherford scattering. 

Indeed, this observable can be used to search for evidence of contact interactions 
that would indicate quark substructure. For instance, in the presence of an additional 
4-quark contact term [394, 718], 


contact = a (PLY br) (br ype), (4.18) 
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Fig. 4.3 The quantity 4r d®2|M a-—scal? /dx as a function of x. This quan- 
tity is plotted for gg —> qg (upper dotted), gq —> qq (dashed), qq > gg 
(lower dotted), gg — gg (dot-dashed), and gg — q@ in the presence of a 
contact interaction with § = 0.5 TeV, A = 1 TeV, and a; = 0.1 (solid). 


the expression for the quark—anti-quark contribution to the cross-section is modified 


to, 
a (4) + (aa) <r} ne 


An illustration of the behaviour of these various contributions is shown in Fig. 4.3. As 
observed previously, the curves for the quark—gluon and quark—anti-quark initial states 
depend only very mildly on x. The reaction gg — gg shows a rapid rise as x + 1, but 
the curve is rather flat for x > 2. In contrast, in the presence of a contact interaction 
with § = 0.5 TeV, A = 1 TeV, and a, = 0.1 there is a marked rise for moderate x, 
2S x <6. In reality this simple picture is distorted somewhat once the convolution 
with the parton distribution functions has been performed and all partonic channels 
summed over, but the essential difference remains. The issue of contact interactions 
will be revisited, in the context of experimental searches at the TEVATRON and the 
LHC, in Chapters 8 and 9. 

A further simple remark concerning the kinematics of the dijet process is useful 
in order to demonstrate its importance in determinations of the PDFs. For a collision 
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involving two partons with momentum fractions x; and x2 of the parent hadrons, 
momentum conservation yields the relations, 


VE ay +22) = pı (cosh n + cosh 7’), (4.20) 
VE ay — x2) = pı (sinh ņ + sinh7/’). (4.21) 

These equations are easily solved for x; and 29, 
Ly = a (e + e) ; LQ = z Ea + gt) ; (4.22) 


Hence different ranges of transverse momenta and rapidities represent probes of par- 
ticular regions of xı and x2. In order to probe large momentum fractions one can 
investigate the behaviour of jets with large pr or ņ. Since a jet has a certain minimum 
pr, in order to access small momentum fractions it is most useful to examine jets 
of moderate transverse momenta but that lie at large rapidities. These features are 
exploited in global fits of PDFs to hadron collider jet data, as will be discussed further 
in Chapter 6. 


4.1.2 Dijets at next-to-leading order 


The next-to-leading-order corrections to dijet production have been known for a long 
time [509, 574]. The virtual corrections are relatively simple due to the fact that it is 
a 2 — 2 process involving only massless particles, so that the scalar integrals can be 
written in terms of only logarithms and constants. 

The perturbative expansion for the partonic process a + b 4 c + d, without the 
average over initial state colours and spins, reads, 


a ; 
Marred = 9) 04+ (52) mato (4.23) 


where the virtual matrix elements mË? a represent the interference of tree-level and 
loop diagrams. For the process qq’ — qq’ the virtual contribution is given by [503], 
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This equation is written in terms of the colour factor V = N2 — 1 and the logarithmic 
function, 


I(x) = log (-=) (4.25) 


where Q? > 0 is an arbitrary momentum scale. Note that this means the function 
develops an imaginary part for x > 0. Since only the real part should be kept in 
Eq. (4.24), this only results in contributions from terms of the form, 


P(t) = log? ( =) > log? (=) r’ ift >0. (4.26) 


The result in Eq. (4.24) also demonstrates that, unlike the simplest Drell-Yan case 
considered in Chapter 3, in general the virtual matrix element exhibits a rich kinematic 
structure. Only the singular terms are proportional to the lowest-order matrix element, 
while the finite remainder depends on s, t, and u in a more complicated way. 

The other partonic contributions can be written in a similar fashion. The virtual 
contribution to the process gg > gg is, 
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which makes explicit the form of the singular terms. The auxiliary function fı contains 
only finite contributions and is defined by 
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In this case it is clear that the singular contributions are not all proportional to the 


leading-order matrix element, mÉ 


OI a Only the soft divergence, represented by the 


1/2? term, multiplies this factor. The collinear divergence, given by terms proportional 
to 1/e, is more complicated due to the fact that the collinear factorization of the matrix 
elements is modified by colour connections between the gluons. This is a general feature 


of calculations for more complex final states. 
Finally, the result for the all-gluon process gg —> gg is 
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where fə is defined by 
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In all of these virtual matrix elements one can also see the emergence of the loga- 
rithms of the renormalization scale, u, that were previously highlighted in Eq. (2.123). 
Inspecting Eqs. (4.24), (4.27), and (4.30) one can see that they all contain a term 
proportional to the appropriate leading-order matrix element multiplied by the factor 


IN. 4 
(ABE - Sate) Un?) = ot), (4.31) 
There is a term of precisely this form, proportional to the beta-function coefficient 6o, 
in Eq. (2.123). 


4.1.3 Scale uncertainties in inclusive jet production 


The dijet process is sufficiently simple that it provides a good laboratory in which to 
perform a more thorough examination of the dependence of the theoretical prediction 
on the renormalization and factorization scales, wr and upr. To do so it is useful to 
consider inclusive jet production, where the notion of an inclusive cross-section 
refers to the idea that it describes the observed properties of any one of the jets in a 
given event. This means that, for a n-jet event, there are n contributions to an inclusive 
jet observable. A natural choice for the hard scale in the theoretical calculation of 
inclusive jet quantities is the transverse momentum of the jet under consideration, pip. 
Therefore a scale choice u = py, means that an event containing n jets requires an 
evaluation of the corresponding matrix elements and PDFs n times, each time with a 
scale proportional to the pr of the jet under consideration. An alternative scale choice 
is given by the transverse momentum of the lead jet in the event (p? ), although this 
introduces a new scale that is not a natural one for an inclusive observable. At the 
Born level the two choice are identical but there are non-trivial differences at higher 
orders which will be discussed later. 

Rather than choosing a common value for ur and up, as in Section 2.2.6, it is 
instead instructive to allow them to vary independently so that the theoretical pre- 
dictions can be pictured in a two-dimensional surface spanned by ur and up. An 
example of such a surface is shown in Fig. 4.4, for the case of the NLO prediction for 
the inclusive jet cross-section at ys = 7 TeV.! Since the theoretical prediction con- 


1We thank Pavel Starovoitov for providing the ROOT ntuples. 
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Fig. 4.4 The scale dependence for the inclusive jet cross-section using 
the anti-k, jet algorithm with R = 0.4 at ys = 7 TeV. Jets satisfy 
60 < pr < 80 GeV and lie in the rapidity range 0 < y < 0.3. Cross-sections 
have been computed with NLOJET++ [777] interfaced with Applgrid [329] 
and are normalized to the prediction at scales of ur = ur = 2.5pr. 


tains logarithms of the form log(ur/pr) it is prudent to vary them in a similar range, 
so that the surface extends as far as p/,./5 < ur, ur < 5p. The two-dimensional 
equivalent of the NLO scale dependence observed, for instance in Fig. 2.22, is a saddle 
shape. In this particular example the peak cross-section is at the saddle point, which 
corresponds to a scale of approximately wp = HF = ph. 

A simpler way to see the saddle point, and the region of mild dependence on the 
scales around it, is to examine a contour plot such as the one shown in Fig. 4.5 (left). 
From this plot it is clear that the cross-section depends much less on the factorization 
scale than on the renormalization scale. For a much higher jet transverse momentum 
range the corresponding plot is qualitatively different, cf. Fig. 4.5 (right). The plot 
appears to have been rotated by an angle of —45° with respect to the vertical, such 
that the saddle point is at somewhat smaller scales and a scale choice of pj, no longer 
corresponds to the peak cross-section. 

This behaviour can be understood as follows. This process is dominated by jets 
produced in the central rapidity region, thus probing partonic momentum fractions x ~ 
2p?,/(7000 GeV), cf. Eq. (4.22). Hence, for low jet transverse momenta x ~ 0.02 and the 
dominant sub-process is gg > gg. At this x value, there is very little up-dependence 
in the gluon distribution, as will be discussed in Chapter 6. This is the behaviour 
observed in Fig. 4.5 (left), where the jet cross-section is relatively independent of the 
factorization scale. In contrast, the higher pr region shown in Fig. 4.5 (right) probes 
much larger parton x values where sub-processes such as gq —> gq and qq — qq are 
also important. In this region both the quark and gluon distributions depend more 
strongly on ur, leading to the “rotation” noted above. 

The same analysis can be performed in different kinematic ranges, for instance other 
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Fig. 4.6 Scale-dependence contours for the NLO inclusive jet cross-section 
using the anti-k, jet algorithm with R = 0.4 (left) and R = 0.6 (right). Jets 
satisfy 1200 < pr < 1500 GeV and lie in the rapidity range 1.2 < y < 2.1. 
Cross-sections have been normalized to the prediction at ur = uF = 2.5pr. 


regions of jet rapidity and jet separation. Fig. 4.6 (left) shows the contours of scale 
dependence in the same high transverse momentum range but now also corresponding 
to much larger rapidities. Again, the rotation has taken place but the saddle point is 
once more at scales near ur = Ur = pr. Fig. 4.6 (right), depicts the scale-dependence 
contours when the jet size is increased from 0.4 to 0.6. The peak cross-section, the 
saddle point, corresponds to smaller values for both scales as the jet radius increases. 

Note that the standard scale uncertainty analysis corresponds to a one-dimensional 
projection of the contour plot along the diagonal ur = ur. In all these cases this would 
result in a curve with a maximum at or near the saddle point. However, as demon- 
strated above, the exact form of the scale dependence clearly depends on the kinematic 
region being studied and such a similarity between the one- and two-dimensional anal- 
yses is not guaranteed. Such considerations need to be kept in mind, not only for inclu- 
sive jet production, but more generally for all theoretical predictions for cross-sections 
at the LHC. 
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Fig. 4.7 Scale dependence of the inclusive jet cross-section at the 7 TeV 
LHC, computed at LO, NLO, and NNLO, for the central scale choices 
ur = ur = p} (left) and ur = pr = p} (right). The calculation uses 
the anti-kr algorithm with R = 0.4 and jets are defined by |y| < 0.5 and 
100 < pr < 116 GeV [422]. Reprinted with permission from the authors. 


This discussion should of course be extended to even higher orders, when such the- 
oretical predictions are available. Very recently, pioneering calculations have extended 
the accuracy of inclusive jet production cross-sections to NNLO [335, 422, 566], as 
shown in Fig. 4.7. The inclusive jet cross-section is shown for a particular slice of 
jet transverse momenta at the 7 TeV LHC, with the central scale given by either p% 
or the pr of the leading jet, př. The behaviour when moving from LO to NLO to 
NNLO clearly depends quite strongly on the choice of central scale. For instance, the 
improvement in the scale dependence at each order that is observed for u = p% is not 
observed for u = ph; for a detailed discussion of this and other related subtleties, see 
Ref. [422]. 


4.1.4 Jet shape 


At NLO there can be two partons in a jet and a description of the jet shape — the 
distribution of energy inside the jet — is possible. Note that by definition, a NLO 
calculation of the inclusive jet cross-section is a LO calculation of the jet shape. Soon 
after the inclusive jet cross-section was first calculated to NLO, comparisons of the 
NLO predictions and the experimental data from the TEVATRON were carried out [510]. 
Perhaps surprisingly, the description of the jet shape using only one parton agreed well 
with the experimental data, as shown in Fig. 4.8. Two caveats apply though: first, the 
non-perturbative effects were ignored in this comparison (expected to be small for a jet 
transverse energy of 100 GeV), and second, the jet shape at large distances from the 
jet centre did not agree with the fixed-order prediction unless an ad hoc parameter 
(Rsep) was introduced. Two partons would normally be combined within the same 
cone jet if they were separated by a distance of less than 2Reone. Agreement with 
the data only occurred if the two partons in the NLO calculation were required to 
be within a distance 2Reone X Rsep, where Rsep had to have a value around 1.3. This 
was later explained (see for example Ref. [508]) by the stochastic instabilities of jet 
clustering using the multiple hadrons comprising a jet, as compared to the original two 
(fixed order) partons. These are complications due to the use of cone-jet algorithms 


Production of jets 195 


mas: u=ET, Rsep=2R 

are H=E7/2, Rsep=2R 
u=Er/ 4, Rsep=2R 
u=Er/4, Rsep=1 3R 


0.004 15 2 25 3 4 5 6 78 100 


Fig. 4.8 The fraction of energy for a jet of radius R = 1.0 that is inside a 
radius r. Data from the CDF experiment for jets of 100 GeV is compared to 
fixed order theoretical predictions at NLO. The curves correspond to dif- 
ferent values of the renormalization and factorization scales and/or values 
of the parameter Rsep. Reprinted with permission from Ref. [510]. 


and are absent for example for the anti-ky jet algorithm. The main point is that jet 
shapes can be reasonably well described using only the one extra gluon present in a 
NLO calculation. 

Jet shapes will be revisited in the context of experimental data from the TEVATRON 
in Section 8.3. With the increasing precision of parton shower Monte Carlos, it has 
become much more common to compare experimental jet shape measurements with 
the predictions of those Monte Carlo programs (where the jet shape is described by 
the effects of multiple gluon emissions, as well as non-perturbative effects). 


4.1.5 Multijets 


Final states containing more than two jets are also of considerable interest at hadron 
colliders. Foremost, their study constitutes an essential test of the theory of strong 
interactions and our ability to describe jets of hadrons using the partonic description. 
In addition, the production of multijets is a considerable background in many searches 
for new physics, typically when one or more of the jets fakes either a lepton or a 
photon. 

From a theoretical point of view the description of multijet states becomes sig- 
nificantly more complicated than the dijet case. The number of Feynman diagrams 
contributing to a given n-jet calculation increases more than factorially with n. In 
addition, as n grows so does the complexity of the colour configurations that must be 
included in the calculation of the scattering amplitudes. The use of recursion relations, 
as described in Section 3.2.1, has been instrumental in taming the factorial growth and 
enabling the calculation of leading-order predictions for n > 4 [220]. These recursive 
techniques, coupled with the D-dimensional numerical unitarity methods mentioned 
earlier, have led to algorithms capable of computing one-loop virtual corrections for 
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Fig. 4.9 NLO predictions for multijet cross-sections at the 7 TeV 
LHC [183], compared to ATLAS data from Ref. [4]. 


essentially arbitrary multijet processes [181, 575]. In view of the factorial growth in 
the number of diagrams it is especially interesting to observe the scaling behaviour of 
the computational time using such techniques. The NJET code [181] scales only as a 
power of n, the number of external partons in the scattering amplitude. Configurations 
that involve only gluons are most time-consuming to evaluate and scale as approxi- 
mately n°. As pairs of gluons are replaced by a quark—anti-quark pair the computation 
becomes less complicated and thus faster, although the scaling behaviour is harsher. 
Ref. [181] indicates that a single such computation for n = 13, corresponding to 11-jet 
production, may be completed in less than a second. This is a testament to the power 
of modern numerical unitarity approaches. 

The combination of these ingredients into full NLO calculations of multijet pro- 
duction is an arduous task, due to the high number of partonic channels and their 
associated infrared singularities. The three-jet case was first available in the NLOJET 
code [777] in 1997, while the four-jet calculation has only more recently been completed 
by two independent groups [180, 231]. Results for up to five jets have been presented 
in Ref. [183], allowing for a comprehensive study of multijet production over a large 
kinematic range. Fig. 4.9 shows the NLO predictions for the n-jet cross-sections for 
n = 2,3,4,5 at the ys = 7 TeV LHC, computed using NJET virtual matrix elements 
and SHERPA for the assembly into a full NLO prediction. The predictions use a central 
scale of Hr /2, which is found to be a scale at which the NLO corrections are small and 
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the scale dependence flat. For n = 3,4,5 jets the agreement between the theoretical 
prediction and the data is excellent, with differences at the level of 30%. Given the 
complexity of the QCD interactions being described, this agreement is remarkable. 
Moreover, as can be seen in the figure, the difficulty of experimentally measuring such 
cross-sections means that the theoretical uncertainties for n > 3 are smaller than the 
experimental ones. The situation is different in the two-jet bin, where the perturbative 
expansion is not under such good control. There the effect of the NLO corrections is 
very large under this set of cuts and the NNLO corrections should be important. Note 
that the cross-section falls by almost the same factor, about ten, when another jet is 
included in the final state. This is an example of Berends scaling, that was originally 
observed in vector boson+jet production [224]. This observation motivates the study 
of the quantity, 

a(n +1 jet) 


ae a(n jet) 


(4.32) 
which is especially interesting since various sources of theoretical and experimental 
uncertainty may cancel in such a ratio. Since Rn is proportional to as at first order, 
this allows determinations of the strong coupling from measurements of jet rates at 
hadron colliders, in a similar manner to previous determinations at LEP [174]. 

Although no results have been presented for more than five jets, it is clear from 
the discussion above that the virtual matrix elements are readily available in the NJET 
code and could be utilized in the future. However, due to the time taken to evaluate 
both the virtual matrix elements and the real corrections, and also because of its 
limited phenomenological interest, NLO predictions for more than five jets may not 
be available for some time. 


4.2 Production of photons and jets 


The production of photons, either on their own or accompanied by additional jets, 
plays an important role at hadron colliders. Since the photon—quark and gluon—quark 
interactions are very similar the Feynman diagrams representing these processes are 
cousins, despite the lack of self-interactions for photons. Of course, photons are pro- 
duced far less copiously than jets due to the difference in strength between the elec- 
tromagnetic and strong forces. However, since a photon is typically a well-measured 
object, these processes can provide useful constraints on hadronic quantities. These 
processes also provide significant backgrounds to the detection of the Higgs boson 
through its decay H > yy. 


4.2.1 Theoretical considerations 


At the most basic theoretical level the description of final states containing photons 
and jets is no more complicated than that of jets alone. To see how to make a direct 
correspondence, consider the process involving a quark-antiquark pair and two gluons, 


0 G7 (p1) + (p2) + 97 (p3) + g% (pa). (4.33) 


where each particle is in a definite (postive or negative) helicity state as indicated by 
the superscript. The leading order amplitude for this process can be written as, 
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where the amplitude has been expressed in terms of subamplitudes that are propor- 
tional to products of colour matrices in two different orders. This decomposition is 
made possible by the relationship between the QCD structure constants that appear 
in the diagram containing the triple-gluon vertex and the anti-commutator of two 
colour matrices, 


ag a — ; fazasbrmb 
[a T] a TISSET ia (4.35) 


4122 
The subamplitudes that appear in Eq. (4.34) are simple examples of colour-ordered 
amplitudes. 

The helicity choice under consideration is a maximal helicity violating (MHV) 
one, with two partons of a given helicity and the remainder of the opposite helicity. 
As a result the corresponding colour-ordered helicity amplitudes are given by simple 
expressions, 


3 
on ae Oe (13) (23)3 
M(G 43,94 593) = RESOLVER (4.37) 


In these two amplitudes the colour-ordering of the gluons is apparent from the de- 
nominators: (23)(41) and (2 4)(3 1) respectively. In addition, the denominator factors 
(12) and (34) are signatures of the presence of the diagram containing a triple-gluon 
vertex. 
The corresponding amplitude for the process in which one of the gluons is replaced 
by a photon, 
0 => g (p1) +a" (p2) + ¥ (p3) + g* (pa). (4.38) 


receives contributions from similar diagrams, and apart from the overall coupling factor 
that is different, the coupling of a photon to quarks does not introduce a colour matrix. 
Therefore the factor T°: in Eq. (4.34) can be replaced by an identity matrix in colour 
space and the overall coupling changed to yield, 


M(G 92.73 94) = ieQag (T™);, 5, [MG a 93 94) + MG, 92 94 93 )] 
= ieQqg (T) iz MG, G2» V3 94)» (4.39) 


where @, is the electric charge of the quark, in units of e. The photon amplitude is 
thus obtained by a sum over both colour orderings of the gluon amplitude. One might 
worry about the presence of the triple-gluon diagram in the original result, which 
should not be present for the photon. However, since this diagram enters each colour 
subamplitude with opposite sign, cf. Eq. (4.35), the sum in Eq. (4.39) ensures that 
this diagram does not contribute to the photon amplitude. Explicitly, the result is, 
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Fig. 4.10 Example Feynman diagrams for direct photon production, 
qg > qy (left) and gg > gy (right). The complete set of Born diagrams 
is obtained by considering also the diagrams with the opposite ordering of 
the gluon and photon on the quark line. 
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where in the final line the denominators (12) and (34) cancel after using the Schouten 
identity (cf. Section 3.2.1) to simplify the numerator, as expected. It is also clear from 
Eq. (4.39) that the two-photon amplitude with the replacement gt(p4) —> y7 (p4), 
must be identical up to an overall coupling and colour factor. 

With such rules it is straightforward to compute amplitudes for processes involv- 
ing photons from results obtained in pure QCD [455]. An amplitude for a process 
containing a photon can be simply obtained from a colour-ordered gluon amplitude 
by appropriate symmetrization. Thus, once care has been taken in order to define the 
photon appropriately according to the requirements in Section 2.1.6, the calculation of 
an m photon plus n-jet final state is straightforward once the (m + n)-jet calculation 
is at hand. 


4.2.2 Direct photon production 


The simplest process to consider at a hadron collider is direct photon production. It is 
represented at leading order by Feynman diagrams such as the ones shown in Fig. 4.10, 
for which one of the helicity amplitudes has already been given in Eq. (4.40). At leading 
order the kinematics of this process are rather simple, with the parton transverse 
momentum equal to that of the photon. This property means that a measurement 
of final states consisting of a photon and a single jet is of particular significance in 
assessing detector performance. The well-measured photon can be used to calibrate 
the response of the hadronic calorimeters, providing a measurement of the jet energy 
scale and its uncertainty. 

Since the hadronic and photonic activity lie in different hemispheres at LO, the 
issue of photon isolation does not enter the perturbative calculation at this order. At 
NLO this is no longer the case, with real radiation contributions allowing partons to 
populate areas of phase space close to the photon. Moreover, diagrams such as the one 
shown in Fig. 4.11 (left) can result in a strong dependence on the manner in which 
the photon is isolated since it contains a singularity when the quark and photon are 
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Fig. 4.11 A Feynman diagram entering the NLO calculation of direct 
photon production (left) and one representative of the fragmentation con- 
tribution (right). The diagram on the left is drawn to emphasize the singu- 
larity when the photon and quark are collinear. On the right, the function 
Dy, — y represents the fragmentation of a quark into a photon. 


collinear. This singularity can be absorbed into the definition of the bare function 
representing a quark fragmenting into a photon. The remaining finite fragmentation 
function D;_,, is thus dependent on the physical scale at which this separation is 
performed, the fragmentation scale Mp. The inclusion of such fragmentation con- 
tributions, dependent on Dy-.,(Mp), is indicated in Fig. 4.11 (right). Just like the 
parton distribution functions, the fragmentation functions are non-perturbative quan- 
tities that must be experimentally determined but whose evolution is governed by 
perturbative QCD. 

Schematically, a differential cross-section for photon production can thus be written 
as the sum of two components, 


do = do14x (Mr) + S dox ® D,_,y(Mr). (4.41) 


t 


The direct, or prompt component, is represented by the first term and the fragmenta- 
tion contribution by the second. The sum runs over all partons, with o;4x the inclusive 
differential cross-section for the production of parton i. As is clear from the equation, 
this separation is well-defined only at a given value of the fragmentation scale Mp 
(and is also scheme dependent). Note that in the case of the Frixione isolation crite- 
rion discussed in Section 2.1.6, the isolation constraint removes the collinear singularity 
present in Fig. 4.11 (left). As a result there is no need to introduce the concept of a 
fragmentation function in this approach. From a theoretical standpoint this is very 
attractive since the calculation becomes more straightforward and amenable to the 
usual techniques of pure QCD. 

The direct photon process can also be used as an effective probe of the parton 
distribution functions, in particular of the gluon distribution. Indeed, until 1999 direct 
photon data were routinely used in the global extraction of PDFs. Since DIS measure- 
ments are not directly sensitive to the gluon PDF this was a useful complement to 
the wealth of available HERA data. However, the inability of NLO predictions of the 
time to accommodate data from some fixed target and collider experiments led to the 
abandonment of this approach. TEVATRON collider photon data at moderate to high 
pr do have the potential to provide information on the PDFs, but the dominant sub- 
process at a pp collider of this energy is gg —> yg. Since the quark distributions are well 
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Fig. 4.12 Relative contributions of the Compton, annihilation, and frag- 
mentation contributions at the 14 TeV LHC, for photons at y = 0 and as 
a function of their transverse momentum. The figures are produced with 
the NLO program JETPHOX. Reprinted with permission from Ref. [462]. 


constrained in this region already, little useful additional information is provided. This 
is not true at the LHC, where the availability of a wealth of data has led to renewed 
interest in constraining the gluon PDF with photon measurements [462]. Fig. 4.12, 
taken from this reference, shows the size of various contributions to the direct photon 
cross-section at the LHC, at central rapidity (y = 0) and as a function of the photon 
transverse momentum. As expected, the “annihilation” contribution — resulting from 
diagrams such as Fig. 4.10 (right) and its corresponding NLO corrections — is much 
smaller than the “Compton” process (Fig. 4.10 (left)). For inclusive direct photon 
production, Fig. 4.12 (left), the fragmentation component is large, even for very high 
Er photons. However after application of typical experimental isolation cuts the less 
well-known fragmentation contribution is much reduced, as shown in Fig. 4.12 (right). 
As a result the direct photon process can be used to probe the gluon PDF at the LHC, 
particularly now that the theoretical prediction is known to NNLO [316]. 


4.2.3 Photon pair production 


The production of photon pairs arises through the leading-order process, 
q+q- yy. (4.42) 


As an example of its importance, it provides the principal background to Higgs boson 
production in the decay mode H — yy. It should therefore be understood in some 
detail. Experimentally, this final-state signature also receives a significant contribution 
from both the direct photon and dijet processes, where one or more of the associated 
jets is identified as a photon. This section will focus on aspects of the perturbative 
calculation of the process in Eq. (4.42) and its higher-order corrections. 

To be clear about the role of different diphoton amplitudes in the higher-order 
corrections it is useful to make it explicit in the following way, which mirrors the 
discussion in Section 3.1. Consider the quantity Mp that represents the amplitude for 
the process qq > yy + n gluons. It may be expanded in perturbation theory as, 
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Table 4.1 The contribution of various diphoton amplitudes to relevant observables 
and the order at which the obervable is then predicted. 


Amplitude contribution Coupling Otot p?’ distribution Gjet gap 


Mo 1 LO : : 
Mo]? 9? NLO LO : 
2Re (MP ME") p NLO : ; 
MoO! g NNLO NLO LO 
2Re (MOMO gt NNLO NLO - 
2 Re ( MP ME gt NNLO : 
Mef gt NNLO > y 
Mn =9" [mo + gM +g MP +...), (4.43) 


where MẸ represents the corresponding l-loop amplitude. Thus Mo) simply rep- 


resents a tree-level calculation, with Mo just the diphoton production process in 
Eq. (4.42). It is then a straightforward exercise to list all the contributions that en- 
ter the calculation of the diphoton cross-section at a given order, remembering that 
real-emission contributions (i.e. n > 0 in Eq. (4.43)) must also contribute. An explicit 
list, up to NNLO, is given in the first column of Table 4.1. The corresponding in- 
gredients for two other observables, the transverse momentum of the diphoton system 
(p}") and the cross-section for diphoton production in association with two jets widely 
separated in rapidity (Tjet-gap); are also shown. The table makes clear the order at 
which each observable can be calculated, in the sense of Section 3.1. For instance, a 
NNLO calculation of the total cross-section also contains a NLO calculation of p3’ 
and a LO one of Gjet-gap- However the converse is not true — a NLO calculation of 
p% is not equivalent to a NNLO calculation of oto, since, for example, it does not 


contain genuine two-loop corrections originating from Mo, 

The NLO QCD corrections to the total cross-section are included in the DIPHOX 
Monte Carlo [254]. This calculation includes fragmentation contributions and thus al- 
lows for a traditional implementation of isolation. The gluon PDF enhancement at 
small momentum fraction x leads to a large flux of gluons at high energy, in par- 
ticular at current LHC energies. Therefore, although diphoton production does not 
involve gluons at leading order, it can receive significantly corrections from such con- 
tributions at higher orders. Although formally suppressed by additional powers of 
the strong coupling, such contributions are numerically important because of the size 
of the gluon PDF. One class of diagrams that contributes to oto, at NNLO, those 
involving loops of quarks as shown in Fig. 4.13, can give a significant additional con- 
tribution [152, 463, 775]. As indicated in Table 4.1, such diagrams enter the calcula- 


2 
tion as jms? | . However, they are particularly interesting since their contribution is 


Production of photons and jets 203 


Y 


Fig. 4.13 Gluon-induced loop contributions to photon pair production. 


separately finite. This is guaranteed by the fact that there is no tree-level gg > yy 
amplitude. In fact this contribution has been computed to the next order in the strong 
coupling, including some of the NLO corrections to the total diphoton rate [230, 235]. 
Despite the apparent high order of the correction terms, since the O(a?) gg + yy pro- 
cess is finite the calculation of the strong corrections to it only requires the technical 
machinery used in a normal NLO computation. 

By now a number of full NNLO calculations of tot have been performed, albeit 
without accounting for the effects of photon fragmentation [313, 340]. As expected, 
the NNLO calculation becomes sensitive to kinematic regions that are inaccessible or 
poorly described by the NLO prediction. One such example is shown in Fig. 4.14, taken 
from Ref. [313], which illustrates the predictions for the azimuthal angle between the 
two photons obtained in the NLO and NNLO calculations of o4.¢. At leading order in 
the total cross-section the photons are produced back to back, so the prediction for 
this distribution is a 6 function at Ad, = a. At NLO this requirement is relaxed, 
but as a result the distribution for Ad, < m is predicted only at leading order. At 
NNLO in the total cross-section A@,, may only truly be predicted at NLO. In this 
way it is very similar to the diphoton transverse momentum (p7}") which is also only 
predicted at NLO in this calculation, cf. Table 4.1. It is also clear from Fig. 4.14 that 
the effect of the NNLO contribution results in a much-improved agreement with the 
data collected by CMS [378]. However, the theoretical prediction is far from perfect, 
with differences of up to 40%, as might have been expected from a prediction that is 
effectively only at the NLO level. Similar agreement has also been obtained in ATLAS, 
as discussed in Chapter 9. 

A limitation of the fixed-order approach can be made clear by considering the 
effect of typical experimental photon cuts. It is common in diphoton analyses to use 
staggered cuts, 

pr > 40 GeV, p? > 404.6 GeV, (4.44) 


where p} and p}? are the transverse momenta of the hardest and softest photon 
respectively and 6 is of order 10 GeV. Such cuts can be useful for reasons of purity of 
the signal, or rejection of fake backgrounds. In the perturbative calculation it raises a 
problem due to the fact that, at LO, the two photons have equal transverse momenta. 
At NLO this is no longer the case and indeed the NLO cross-section is quite sensitive 
to the value of 6, as shown in Fig. 4.15. For negative 6 the cross-section grows rapidly 
and, with a LO cross-section of 4.8 pb with these cuts, the NLO corrections become 
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Fig. 4.14 The azimuthal angle between the two photons, Ad, at NLO 
(lower histogram) and NNLO (upper histogram) [313]. The data points are 
as observed by the CMS collaboration [378]. The height of the histogram 
bins indicates the scale uncertainty. The lower panel shows the ratio of the 
data to the NNLO prediction. Reprinted with permission from Ref. [378]. 


very large. Moreover, the prediction exhibits an interesting cusp in the region of small 
ô. This is indicative of the presence of terms proportional to 6 log 6 resulting from the 
emission of soft gluons [545]. In order to provide a well-controlled prediction for the 
diphoton cross-section in this case, when the staggered cuts are very close together, 
some form of resummation of these logarithms should be performed, cf. Chapter 5. 
However, it is worth noting that away from the threshold region, for instance in the 
region of the Higgs signal where myy ~ 125 GeV, the reliability of the calculation is 
not spoiled by such logarithms. 

By now, NLO calculations of the diphoton process in association with up to three 
jets are available [185], using the Frixione isolation criterion. On the one hand these 
predictions can be used to provide further tests of QCD and the understanding of pho- 
ton production. For instance, just as in the case of pure-jet production (cf. Section 4.1), 
one can now construct NLO predictions for cross-section ratios that are sensitive to 
the strong coupling. In addition, such processes are important backgrounds for Higgs 
production, particularly in the weak boson fusion channel, when the Higgs boson sub- 
sequently decays to photons. 
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Fig. 4.15 The effect of staggered transverse momentum cuts in a NLO 
calculation of diphoton production. 


4.3 Production of V+jets 


The production of vector bosons in association with jets is a benchmark process in the 
SM for a number of reasons. As already discussed, it lends itself to a good description 
in perturbation theory because of the intrinsic hard scale given by the vector boson 
mass. Moreover it probes a wide range of parton luminosities and final state kinematics 
and it is thus able to test the perturbative description of collider processes in a very 
broad way. In addition, calculations of vector boson plus jets production have been 
the test-bed for providing systematic improvements to parton shower predictions. Al- 
though beyond the scope of this chapter, Chapter 5 will discuss in some detail various 
schemes for matching the approximate treatment of parton branching in shower Monte 
Carlos to exact results for one or more emissions at LO or NLO. Although more widely 
applicable, most of these methods were developed in the framework of V+jet produc- 
tion. The fact that both multijet parton-level predictions and experimental data are 
widely available has been crucial to the development and testing of such ideas. 

On the experimental side the motivation for studying such processes is also clear. 
The decays of the W and Z bosons can lead to final states containing charged leptons 
and/or missing transverse energy, which are then observed in association with jets. Un- 
derstanding such final states is crucial to many searches for new physics. For example, 
the production of a Z boson that decays to neutrinos, in association with jets, provides 
the dominant background to missing Er (MET) plus jets searches for dark matter and 
supersymmetry. In addition, such final states represent considerable backgrounds to 
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Fig. 4.16 LO cross-sections for W+, W~, and Z production at proton- 
proton colliders as a function of centre-of-mass operating energy, ys. For 
production of a vector boson in association with jets, the jets satisfy 
pr > 40 GeV, |y| < 5 and are found using the kr algorithm with D = 0.4. 


known SM processes; for example, the production of top quarks and the investigation 
of the newly discovered Higgs boson. Probing the properties of these known particles 
requires a good understanding of these high-rate backgrounds. The fact that the rates 
for the production of a vector boson in association with many jets are often significant 
in many LHC analyses can be appreciated by inspecting Fig. 4.16. At current LHC 
operating energies the cross-section for producing a vector boson in association with a 
single 40 GeV jet is a few picobarns. For each additional jet the cross-section falls by 
a factor of about three or four, a further example of the approximate Berends scaling 
discussed in Section 4.1. 


4.3.1 Tree-level and one-loop amplitudes for V + 1 jet 


For the production of W and Z bosons, with couplings to leptons and quarks that 
depend on their helicities, it is most useful to express partonic matrix elements in 
terms of helicity amplitudes. The helicity amplitudes can be easily dressed with the 
appropriate couplings as necessary. 

For V+1 jet production this is a particularly compact way of writing the matrix 
elements since there is only one independent helicity configuration for the leading- 
order and virtual amplitudes. These amplitudes were first computed in Ref. [576] and, 
using the notation of this book, in Ref. [234]. To establish the notation, the process 
under consideration is 
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0 — qt (p1) + 9* (p2) + 7 (p3) + & (pa) + £ (ps), (4.45) 


where all particles are outgoing and their helicities are indicated by the superscripts. 
Following Ref. [234], the leading-order amplitude for this helicity configuration is given 
by 

AU = 2e? grt Ate (4.46) 


1123 
where ag, 21, and ig are the colour indices of the gluon, quark, and anti-quark respec- 
tively, and the basic function is 


2 
Ate = —i a2) X (4.47) 
(12) (23) (45) 
The one-loop amplitude contains an additional decomposition according to the colour 
prefactor that appears. The decomposition is into leading-colour (lc) and sub-leading- 
colour (slc) amplitudes, where the designation corresponds to an expansion in (the 
inverse power of) the number of colours (Ne), 


1—loop __ 2. As Ne a lc 1 slc 


The leading-colour amplitude is given by 
€ 
E | 
— $23 — $23 


2 
Ale = crates | 1 ( H ) 
E — s12 
ka (3 4)* 379 28 
i G5 (03) ay 1 (SR Se 
—$23 


(3 4) (13) [15] Lo ( =) kX 3). [1 le (45) ne) (4.49) 


(12) (23) sa 2 (1.2) (23) sis 


which is written in terms of poles that are proportional to the tree-level amplitude and 
a finite remainder. The one-loop factor cr has previously been defined in Eq. (3.62). 
The remainder introduces a dependence on additional functions that are defined by 


_ ln(z) 5 Lo(x)+1 
S eS aa 
Ls_i(2,y) = Lio(1— x) + Lio(1—y) + lng ny— Z, (4.50) 


6 


where the dilogarithm function Lig(x) is defined in Eq. (A.9). The dilogarithm is 
ubiquitous in general one-loop amplitudes since it naturally appears in the analytic 
expression for scalar box integrals. Indeed, the function Ls_; is simply related to 
a scalar box integral evaluated in six space-time dimensions, which has neither an 
infrared nor an ultraviolet divergence. Note that the functions Lo(x) and L; (x) have the 
property that they are finite in the limit x — 1, which reflects a useful reorganization 
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of the amplitude into functions that are particularly numerically stable. 
The sub-leading-colour contribution can be expressed in terms of the same func- 
tions as, 


1 EN ag ENS 7 

sle _ _ tree H H 
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(12) (45) ae 
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TEN 012? (32) Lal Sess 
e tsa (=i. z) ee oe = 


23) [12] Lo(=) (31) [12] (42) [25] Li (=) 
(12)? (45) $23 (12) 813 
(31) (12) (24 aa Lo(=38) 1 p15) 4 BAN) 
. (4.51) 
(12)? (45) 513 2 [1.3] [32] (12) [45] 


After summing over all helicity combinations and crossing to account for each partonic 
configuration, these formulae are sufficient to construct the complete virtual matrix 
elements for a generic V + 1 jet process. 


4.3.2 Next-to-leading-order calculations 


Using numerical implementations of the types of on-shell unitarity methods discussed 
in Section 3.3.1, NLO predictions for V+jet production have been computed for up 
to five additional jets [232]. At present these calculations are now being limited not 
by the complexity of the one-loop calculations, but by the demands of the real radia- 
tion component. The sheer number of infrared-singular phase-space regions presents a 
serious challenge to the numerical stability and running-time of the code. With these 
calculations in hand it is instructive to examine the effect of these NLO corrections on 
the simplest possible observable, the V +n jet cross-sections with a given jet definition. 
As can be seen in Fig. 4.17, the NLO corrections reduce the scale dependence of the 
predictions considerably. For n > 2 the NLO calculation falls within the standard LO 
uncertainty bands, but for n = 1 this is not the case. In this case the NLO prediction 
is also a factor of two larger than at LO. This is explained by the fact that, as noted 
earlier, the V + 1 jet cross-section is atypical in that it does not depend on either the 
gg or qq parton luminosities, both of which are large at the LHC. These contributions 
only enter at NLO, in the form of real radiation corrections corresponding to diagrams 
such as the one in Fig. 2.26, and cause an unusually large NLO enhancement. 

The situation becomes more interesting when turning to less-inclusive observables. 
The jet transverse momentum distribution predicted for V+jet events can exhibit 
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Fig. 4.17 LO and NLO cross-sections for Wt+ jet production at the 
7 TeV Luc, for anti-kr jets with R = 0.5, adapted from Table I of 
Ref. [232]. The uncertainties are shown as blue (hatched) and red (solid) 
bands, for LO and NLO respectively. 


the type of “giant K factors”, corresponding to very large NLO corrections, already 
discussed in Section 2.2.7. Similar issues have also been observed for higher jet mul- 
tiplicities. In these cases the behaviour of the NLO calculation is perfectly normal 
in the regions that dominate the cross-section, that is the production of an on-shell 
vector boson with jets that are central and fairly close to the transverse momentum 
threshold. It is away from these regions, in more extreme kinematic configurations, 
that the bad behaviour is observed. When considering these regions it is clear that 
one should not expect the use of a fixed scale, for instance the mass of the vector 
boson, to produce reliable results. In the tail of a distribution, for instance at very 
high jet transverse momentum, that very scale provides an alternative physical choice 
that might be expected to provide a more sensible prediction. 

The choice of a scale that changes event by event rather than one that is fixed, for 
instance at the vector boson mass, must still be performed with care so as to capture 
the correct behaviour. Fig. 4.18, taken from Ref. [225], shows LO and NLO predictions 
for the transverse momentum of the second jet in W +3 jet events, computed at the LHC 
for two different event-by-event scale choices. When using the scale choice y = EW, 
the transverse energy of the W boson, the NLO prediction differs substantially from 
the LO one and, for sufficiently high transverse momentum, becomes negative and 
therefore unphysical. This is clearly a breakdown in the perturbative expansion due 
to a poor choice of scale. For jets at large transverse momenta EW is not the relevant 
physical scale. The dominant kinematic configuration is two jets that are produced 
approximately back to back, with the W boson and the third jet relatively soft. A 
sketch of such a configuration is shown in Fig. 4.19. In that case a scale such as 
b= Hr, the scalar sum of all partonic transverse energies, is more appropriate. Using 
such a scale restores the good behaviour of the NLO prediction, which now remains 
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Fig. 4.18 The Er distribution of the second jet in W~ + 3 jet events at 
the 14 TeV LHC, predicted at LO and NLO [225]. Predictions are shown 
for two central scale choices, u = EW (left) and u = Hr (right). Reprinted 
with permission from Ref. [225]. 
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Fig. 4.19 A W + 3 jet configuration for which FE would not 
be a good scale choice. As indicated by the length of the arrows, 
ED x EP > ER, EY. 


physical and rather close to the leading-order distribution, as can be seen in Fig. 4.18 
(right). 

As already discussed in Section 4.1, the approximate scaling of the V+jets cross- 
section with the number of jets was first observed in the original leading-order calcu- 
lations of up to V + 4 jets [224]. At that time such calculations were important for 
assessing the leading backgrounds to top pair production in the semi-leptonic decay 
mode. With the availability of NLO calculations for such quantities these observations 
can now be re-examined at higher order. The NLO predictions for the ratio 


a(W +n jets) 


~ o(W + (n — 1) jets) oe) 


Rn 
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Fig. 4.20 The dependence of the ratio Rn, as defined in Eq. (4.52), on 
the jet multiplicity n, for four different jet algorithms. The theoretical 
prediction is computed at NLO using the BLACKHAT code and is shown 
for jets with pr > 30 GeV at the 7 TeV LHC. 


are shown in Fig. 4.20, for n < 4. The ratio is shown for two different jet algorithms, 
anti-kr and SISCone, and two choices of the jet separation, AR = 0.4 and 0.7. Thus, 
for a variety of typical jet algorithms, the cross-section falls by approximately a factor 
of five for each additional jet present in the final state. The exact value of this ratio, 
and its approximation by a constant, is clearly dependent on the details of the cuts 
and algorithm. In particular it fails for larger jet separations where the phase space 
for multijet production becomes significantly constrained. Nevertheless, the idea that 
cross-sections for high jet multiplicities — beyond the scope of any present calculation 
— could be approximated in this way is very attractive. Such ideas have been the 
focus of renewed interest in recent years [570, 571]. 

Calculations to NNLO are now available for the W+jet [274] and Z+jet [269, 564] 
processes. In both cases the NNLO calculations indicate only a small correction at 
this order, which is especially reassuring given the large NLO K factor observed in 
Fig. 4.17. Most importantly, the theoretical scale uncertainty is reducted to the level 
of about 5%. These developments pave the way for precision studies using these final 
states in upcoming LHC runs. 


4.3.3 Jet algorithms and scale dependence 


These processes provide an ideal arena for investigating the effects of different jet al- 
gorithms and their interplay with, for instance, the issue of scale dependence. The 
dependence of multi-parton fixed-order jet cross-sections, at both LO and NLO, de- 
pend on these details in ways that are not always obvious. For instance, at leading 
order each parton is associated with a single jet and the imposition of a jet algorithm 
with size parameter R simply requires that all pairs of partons are separated by the 
distance R in (7,@) space. The larger the distance requirement, R, the smaller the 
cross-section — with a rather strong dependence due to the collinear singularities as- 
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sociated with parton splittings in the underlying matrix elements. At NLO there can 
be two partons in a jet, so that for the first time, jets can have internal structure. 
Although the collinear singularities have been cancelled against the virtual corrections 
to yield a finite result, one remnant of this cancellation is the fact that the theoretical 
prediction inherits a logarithmic dependence on the jet size. Controlling these loga- 
rithms can become important if the value of R is small. For this reason it is possible, at 
NLO, to increase the cross-section by increasing the value of R. The larger the number 
of jets in the final state, the greater the potential difference between the behaviour of 
the LO and NLO predictions for the dependence on the jet size. 

It has already been argued that scale choices based on the variable Hy are well 
motivated for the W + 3 jet process. As the number of jets present in the theoretical 
calculation increases this becomes ever more true: the number of possible relevant 
scale choices grows rapidly, while the definition of Hr tries to capture some of the 
essential hardness of the scattering process in a generic fashion. The results presented 
in this section use a central scale choice of 4 = Hr /2, where the prefactor is chosen for 
a mixture of theoretical and pragmatic reasons. This scale yields well-behaved cross- 
section predictions at both LO and NLO, with K factors near unity [225]. In addition, 
as will be shown in Chapter 9, the use of such a scale results in good agreement with 
LHC data. 

The results of a study of the dependence of the cross-section on the jet algorithm is 
shown in Fig. 4.21. Predictions from the BLACKHAT +SHERPA collaboration [225] are 
shown for both the anti-ky and SISCone jet algorithms, as a function of the jet size 
parameter R. The leading-order cross-section for W + 1 jet is independent of the jet 
size and algorithm because the jet consists of only one parton. At NLO the SISCone 
cross-section is slightly larger than the anti-kr cross-section for the same jet size, as 
expected, since the phase space for two partons to be in the same jet is greater for 
the SISCone algorithm than for the anti-kr algorithm. The NLO cross-section grows 
with jet size since it is more likely to find two partons in the same jet as the jet size 
increases. 

For W + 2 jet production, the LO cross-sections decrease with increasing jet size 
since the two partons must be separated by a distance AR. The effective separation 
is larger for SISCone than for anti-kr (AR = Rj: for anti-ky), as discussed earlier, 
leading to smaller cross-sections for SISCone. For W + 2 jets at NLO there can be 
either two or three partons in the final state, and either one or two partons in a jet. 
Now there are two competing effects with regards to the dependence of the jet cross- 
section on jet size: the AR requirement and the larger phase space for two of the three 
partons to be in the same jet. The second effect wins, with the net result that the 
jet cross-sections increase with increasing jet size, with SISCone being slightly larger 
than anti-kr. 

For three and four jets in the final state the cross-sections decrease with increasing 
jet size at both LO and NLO, with the slope becoming steep at LO. The first (AR) 
effect becomes dominant over the second (phase-space). The anti-ky cross-sections are 
larger than the SISCone ones at both LO and NLO. It is interesting to note that 
while the scale uncertainties increase dramatically with increasing number of jets at 
LO (since each extra jet requires an extra power of a,), the uncertainties at NLO are 
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Fig. 4.21 Cross-sections for W +n jet production at the 7 TeV LHC. Pre- 
dictions are computed at NLO and LO, with jets defined by pr > 30 GeV, 
and are shown ( from top to bottom) for n=1,2,3, and 4 jets (5 jets at LO 
only). The predictions were generated using BLACKHAT +SHERPA ROOT 
ntuples, using CTEQ6.6 PDFs, a central scale of Hr /2, and uncertainties 
obtained by varying this scale by a factor of two in each direction. 


relatively stable. 

The scale dependence for the W + 4 jet cross-section at the 7 TeV LHC is shown in 
Fig. 4.22, for LO and NLO predictions, using the anti-kr jet algorithm. At LO the scale 
dependence is fairly trivial. The cross-sections decrease monotonically with increasing 
scale. The size of the cross-section decreases with increasing jet size since, again, the 
larger the jet size the further the partons are required to be separated at LO. The LO 
cross-sections are smaller for the SISCone algorithm than for the anti-kr algorithm, 
for the same jet size parameter, given the larger effective separation required by the 
SISCone algorithm. 

At NLO, the cross-section behaviour is non-monotonic, with the anti-kp cross- 
sections having a peak cross-section around a scale of Hy /2 (the peak cross-section 
for the SISCone jet algorithm occurs at lower scales, at or less than Hy /4). At this 
scale, for the anti-kr jet algorithm, the K factor is almost exactly 1; this is not strictly 
true for other jet sizes or for the SISCone algorithm, although the K factors do not 
differ greatly from unity for these other choices. Since the scale Hr/2 is at or near 
the peak of the anti-kr cross-section, any scale variations (for example in the range 
Hy /4 to Hr) will be one-sided. At smaller scales, the NLO cross-sections decrease; in 
fact the cross-section for a scale of Hy /8 for the anti-kr jet algorithm is negative. The 
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Fig. 4.22 The cross-sections for W + 4 jet production as a function of the 
renormalization and factorization scale for pp collisions at 7 TeV. The pre- 
dictions were generated using BLACKHAT +SHERPA ROOT-ntuples, with 
CTEQ6.6 PDFs. 


decrease in the cross-sections at low scales becomes milder as the jet size increases (or 
if SISCone is used instead of anti-kr). 

If the NLO and LO cross-sections were evaluated at a scale of about H7/5 (corre- 
sponding to 80 GeV, or approximately my), the K factor would be much larger than 
one, with the K factor increasing as the jet size decreases. This latter behaviour is 
partially due to the divergent behaviour of the LO cross-section and the fact that the 
NLO cross-section decreases more rapidly with low scales for smaller jet sizes. 

An understanding of the dependence of the cross-section on jet algorithm and 
jet size can be crucial when comparing data to theory. For example, in Ref. [227], 
the Z + 3 jet cross-section was calculated at the TEVATRON, using both the SISCone 
and anti-kr algorithms (with R = 0.7 for both). Using a central scale of Hr/2 the 
calculations give 


Fanti-kr = 48.7775 fb, O81$Cone = 40-3755 fb. (4.53) 


At first glance the anti-ky cross-section is noticeably higher than the SISCone one and 
has a smaller scale dependence. However, if the peak cross-section is used for both 
jet algorithms (i.e. Hr/2 for anti-kr and Hr/4 for SISCone), the cross-sections and 
uncertainties become very similar. 
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(a) (b) (c) 
Fig. 4.23 Born-level diagrams representing the production of vector 
bosons Vı and V2 at hadron colliders. Depending on the identity of Vi 
and V2, only a subset of these diagrams contributes to a given amplitude. 


4.4 Diboson production 


The production of pairs of vector bosons at a hadron collider is represented by the 
diagrams shown in Fig. 4.23, where V; and V2 represent either W or Z bosons or 
photons. Depending on the final state, some or all of these diagrams may contribute. 
For example, the production of a Zy pair only includes diagrams (a) and (b), since the 
Z boson is neutral and therefore does not couple directly to the photon. However, all 
of these processes produce final states that are important to measure experimentally, 
for a number of reasons. 

In the first instance these processes provide a wide range of benchmark cross sec- 
tions with which to test the SM. This is illustrated in Fig. 4.24, which shows the 
proton-proton cross-section for each final state as a function of the collider energy. 
Measuring these cross-sections, first at the TEVATRON and then at the LHC, provides a 
demonstration of the ability of an experiment to measure cross-sections that are com- 
parable to new physics effects that are being sought. For instance, the cross-section 
for the Higgs-mediated process, gg > H — WW at 8 TeV, for a SM Higgs boson with 
mass my = 125 GeV, is approximately 5 pb. This is at the same level as the smallest 
diboson cross-sections shown in Fig. 4.24. Moreover, any analysis of the Higgs boson 
decays to W and Z pairs requires an accurate description of these diboson processes 
in order to disentangle them from a potential Higgs boson signal. 

Apart from their use as calibration measurements and their relation to the search 
for the Higgs boson, these processes also provide stringent tests of the non-Abelian 
gauge structure of the theory. This structure manifests itself in the form of the triple- 
boson couplings shown in Fig. 4.23 (c). The possibility of anomalous triple gauge-boson 
couplings (aTGCs), beyond those predicted in the SM, can be investigated by making 
precision measurements of the different diboson cross-sections, as will be discussed in 
Chapter 9. 


4.4.1 Tree-level and one-loop amplitudes for diboson processes 


Once the decays of the vector bosons are taken into account, the matrix elements for 
diboson production are most easily expressed in terms of helicity amplitudes. As shown 
in Ref. [471], the tree-level and one-loop amplitudes for all diboson processes can be 
described in terms of a set of primitive amplitudes that are dressed according to the 
particular process at hand. For the sake of illustration, consider the WW process, 
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Fig. 4.24 NLO cross-sections for diboson production at proton—proton 
colliders as a function of centre-of-mass operating energy, ys. Photons 
satisfy pr > 40 GeV and |y| < 5. 


0 u~ (p1) + a (p2) + £ (p3) + D7 (pa) + C+ (ps) + v (pe) (4.54) 


that is, with all momenta outgoing and the superscripts labelling the particle helicities, 
“—” and “+” representing left- and right-handed helicities respectively. The tree-level 
amplitude for this process can be written as 


2 2 

Atree = (=) õi i2 Pw (834) Pw (S56) Lae + Coyne) (4.55) 
sin” Ow 

where the propagator and coupling factors are denoted by 


Pyw(s) = i 


2 . ? 
s — my +i wmyp 


sı2(1 = 2Qu sin? Ow) 
$12 — me, 


Cru = 2Qu sin? Ow + (4.56) 


Note that the factor CŁ,u contains two terms, representing the coupling of a left- 
handed up quark to an intermediate photon and Z boson. At°®4 and Atree? are 
gauge invariant primitive amplitudes: the coupling structure of Ate- corresponds 
to Feynman diagrams containing electroweak bosons directly attached to the quark 
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line, while A'°®? reflects a triple gauge-boson vertex. These same amplitudes can be 


recycled for other diboson processes; clearly there is no contribution from A'’®? in 
the case of ZZ production. The specific forms of the tree-level amplitudes are, 
(13) [2 5] (6| (2 + 5)/4) 

534 $56 t134 


[K1 3) [25] @1(2 + 5)14) + [24] (16) 81a + 6)15)|, (457) 


Atreesa =i 


Atree.b = ú 
512 534 556 


where tijm = (ki + kj + km): 
At one-loop the amplitude has the same basic structure, 


—loo Qs N? =1 e? : a 
ae (S ( N. ) (= —) Sixi2 Pw (834) Pw (s56) [A + Cru A] (4.58) 


where the one-loop primitive amplitudes A? and A? correspond to all possible dressings 


of the corresponding tree-level primitives. It is convenient to decompose the amplitudes 
further according to 


A® = cr [aney + iF), (4.59) 


and similarly for A?. In this decomposition V contains a divergent contribution that 
is common to both primitives, 


1/ ae Ge ty A a Gea 7 
V= 4.60 
E2 (+) 2e —S812 2’ ( ) 
and F is the finite remainder. Since A?’ corresponds to a single diagram, a vertex 


correction, the result for this piece is very simply written in terms of V alone and 
there is no remainder, 


F’? =0. (4.61) 


The primitive A® is more complicated since it corresponds to box diagrams, in par- 
ticular ones that contain two external momenta that are not light-like. It is given 
by 


PES 


(13)? [25] (21(5 +6962 + DD | Emr tga; S34, Son) 
(3.4) [56] tiga UI +O [84] (56) tiga (2/(5 +. 6)|1)3 | 2 122 1849 234 256 
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BIGA CIE + OM Ay BASE) RIT SN su 


(16) (26) [14]? tisa Lo ( =) 1 (26) [14] (6|(2 + 5)14 | e(a) 
[3.4] (56) (2|(5+6)|1)? s34 2 [3 4] (5 6) (2|(5 + 6)|1)? (—s34)? 


3 (6|(2 + 5)|4)? (—tisa)(—s12) — s34 ; 
4 [34] (56) tiga (2|(5 + 6)|1) toe (sa, ) + Ls4/12 log(—*) = 
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1 (t234812 + 2534556) [4 5)? (3 6)? 
fe 3m 
PIs(s12, 834,856) + 9°75 + )I1)Ag (BaB * G4) (OO) 
(36) [45] (tisa — t234) 1 (+54? ies 
(2|(5 + 6)|1)A3 2 [3 4] (5 6) t134(2|(5 + 6)|1)’ 
where the symmetry operation ‘flip’, defined by 
flip : 162, 365, 466, (ab) [ab]. (4.63) 


should only be applied to the terms inside the brackets ({ ]) in which it appears. The 
amplitude is written in terms of 


612 = S12 — 834 — 856, 634 = 534 — $12 — 856, 656 = S56 — S12 — S34, (4.64) 


as well as the quantity A3, which is an example of the appearance of Gram deter- 
minants in one-loop amplitudes that was discussed in Section 3.3.1. It is given by 


A; = —4 $12 P12 P34 


2 2 2 

= Sip + $34 + Seg — 2812834 — 2812856 — 2834856. (4.65) 
12 4 TS: 12534 12556 34556 

P12° P34 $34 3 96 


The amplitude is written in terms of a basic set of integral functions, some of which 
have already been introduced in Eq. (4.50). The remaining functions correspond to a 
new box configuration that is defined by 


—2mh 2 2 
LsL; (s,t,m7,m3) = —Lig (1 zi) Lis (1-2) 


1 —s 1 —s —s 
log? H= l . (4.66 
zine? (=) + ple (Sa) (Sz): 90 


and a triangle integral with no light-like external legs, 


T3™ (812, 834, 856) = ~The 2(Lia(-p2) + Liz(—py)) + log(pa) log(py) (4.67) 


ve (2) oe (Te) + F 


512 534 2856 (4.68) 


where 


856° 556" ET 
The coefficient of the logarithm reads 
La = 386 (tisa = tesa) GIC + 2)/4) | + 2)15) | 3 (36) MIQ + 2)(3 + 415] 
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and the three-mass triangle coefficient T is given by 
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The primitive amplitudes given here are sufficient to describe all helicity amplitudes 
for the WW process and, after suitable permutations of momenta, all diboson pro- 
cesses. The interested reader is referred to the original paper for further details [471]. 


4.4.2 Basic properties of diboson processes 


Using results for the amplitudes presented above, all of the diboson processes have 
been computed to next-to-leading order [449, 471, 472]. The most recent treatments 
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Fig. 4.26 Leading-order diagrams representing photon radiation in the 
decay of the W (left) and the Z (right). For e~e*y production, the two 
shaded circles indicate that the photon may be radiated from either of the 
electrons in the Z decay. 


have included the effect of single-resonant diagrams [311, 314, 760]. Examples of such 
diagrams, that are required for electroweak gauge invariance when the decays of the 
vector bosons are included, are shown in Fig. 4.25. Such single-resonant diagrams 
are not important in the calculation of the inclusive cross-section for e~ete~e* pro- 
duction; however, they sculpt other distributions, most notably the invariant mass of 
the 4-lepton system. In fact, the presence of just such a contribution was useful in 
cross-checking the first observation of the putative Higgs boson in the decay channel 
H > ZZ > 4 leptons [369]. Finally, a further important contribution to many of the 
diboson production processes originates from gluon—gluon initial states. These are the 
counterparts of the diphoton loop diagram depicted in Fig. 4.13. 

An important issue for the Vy processes is the treatment of photon radiation. 
In particular, the photon is radiated copiously from any leptons produced in the de- 
cay of the vector boson V, for instance via the leading-order diagrams indicated in 
Fig. 4.26. The propensity of the leptons to radiate in this way can be assessed by 
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Fig. 4.27 The ratio of LO cross-sections for the e-e*y and viy final 
states, as a function of the minimum photon pr. 


comparing the cross-sections for e` ety and vy as a function of the photon trans- 
verse momentum. Such a comparison, shown in Fig. 4.27, demonstrates that for suf- 
ficiently high photon transverse momentum, radiation in the decay is effectively re- 
moved. As a result the ratio o(e~ et y)/a(viy) tends to the ratio of branching fractions, 
BR(Z — e-e*)/BR(Z > vv) = 1/6. However, for typical photon pr cuts used at the 
LHC, the ratio exceeds this by up to a factor of two. Therefore the proper inclusion 
of this radiation is essential in order to provide a good theoretical description of the 
whole event sample. Conversely, in probes of the diboson production mechanism, such 
as in searches for anomalous couplings, it is advantageous to suppress the radiation in 
the decay by using a higher photon pr cut. 

The dependence of the WW cross-section, for the LHC operating at 8 TeV, on 
the choice of the renormalization and factorization scale is shown in Fig. 4.28. Since 
this is an electroweak processes the scale dependence of the LO result is very mild, 
originating solely from the factorization scale inherent in the definition of the PDF. 
Of course, this dependence is not indicative of the theoretical uncertainty of the pre- 
diction, as can be seen from the NLO curves. These lie outside the range of the LO 
scale uncertainties due to the O (as) real radiation corrections that are sensitive to the 
gluon PDF. The NLO corrections are quite large, increasing the theoretical prediction 
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Fig. 4.28 The scale dependence of the WW cross-section at the 8 TeV 
LHC. The choice of scale is shown relative to the value uo = mw. At LO 
the cross-section only depends on the factorization scale, zr, while at NLO 
it also develops a dependence on the renormalization scale, wr. 


by approximately 40% and include a 3% contribution from gluon-gluon box diagrams 
that are the counterparts of those shown for the diphoton process in Fig. 4.13. The 
newly introduced renormalization scale dependence is lessened in a full NNLO calcu- 
lation of these processes, for which results are now available [336, 557, 595]. Since they 
must describe the scattering of two massive particles, the two-loop amplitudes that 
enter the computation of these processes [560] represent the current frontier of such 
calculations. 

One of the other ingredients in the NNLO diboson cross-section is the NLO calcu- 
lation of the diboson+jet final state. These processes are interesting in their own right, 
in particular as backgrounds — for instance in jet-binned Higgs boson searches. The 
NLO results are known in all cases [187]. The subtleties discussed for inclusive diboson 
production above again apply with, for instance, gluon—gluon initiated contributions 
once more representing an important higher-order component. 
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4.4.3 Radiation amplitude zero 


Consider the tree-level amplitude for the process, 


0 — u (p1) + d* (p2) + 4 (ps) + D% (pa) + 7+ (ps) (4.71) 


which is relevant for W~ y production, with subsequent leptonic W decay. The ampli- 
tude is [471], 


tee e? Pw (s34) (13) 
A = Va (Sg) Vat Sate aang ayy aR Cs + Qus) (A) 
using the same notation as before, cf. Eq. (4.56), and where Qu = 2/3 and Qa = —1/3 
are the charges of the up- and down-quarks. Note that the amplitude can be written 
in this way despite the presence of the diagram in which the photon is radiated from 
the W boson thanks to the relation Qw- = Qa— Qu. Stripping away overall coupling 
and spinor factors, the amplitude is thus proportional to a simple factor, 


Qu P2: Ps + Qa Pı ` Ps, (4.73) 


that can in turn be readily evaluated in the partonic centre-of-mass. Introducing the 
angle 0* between the photon and the up-quark (positive z) direction in this frame, the 
amplitude is thus proportional to the combination, 


Qu (1 + cos 0*) + Qa(1 — cos 0*). (4.74) 


Quick inspection of this equation reveals that the amplitude exactly vanishes for the 
scattering angle cos 6* = (Qu+Qa)/(Qa—Qu) = —1/3. This vanishing of the amplitude 
for a specific scattering angle is characteristic of this process and a feature of all the 
contributing helicity amplitudes. It has been termed a radiation amplitude zero 
and its features well-known for some time [764]. 

At a hadron collider it is more useful to translate this critical scattering angle into 
the corresponding rapidity, ył. This is easily done, with the result, 


1 1+ cos 6* 
eal zx —0.35. 4. 
Wy = 5 og (152) 0.35 (4.75) 


In order to construct a boost invariant quantity, and thus obtain an observable that can 
be measured in the laboratory frame without recourse to reconstructing the partonic 
centre-of-mass frame, it is usual to consider the rapidity difference, Ay* = y$ —yj,. For 
typical experimental analyses, where most events are observed with p% significantly 
smaller than my, the mass of the W boson means that for this 6* the rapidity of 
the W boson is positive but significantly closer to zero. Explicitly, for small photon 
transverse momentum p7, (relative to my), 


1 mw — pp cos * 
wl 4.76 
Yin © 5 log (= FR (4.76) 
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Fig. 4.29 Left: the distribution of Ay = yy — yw expected in Wy events 
at the TEVATRON, at LO (dashed) and NLO (solid, red) in QCD; the 
radiation zero is manifest as a dip in the distribution around Ay* ~% —0.45. 


Right: the charge-signed (lepton, photon) rapidity difference in Wy events 
observed by DØ at the TEVATRON [97]. Note the qualitative similarity 
between the two plots. Reprinted with permission from [97]. 


so that, plugging in the numerical values, 


y,min 


*” PT 
xy =. 4.77 
Yw 3 ( ) 


Therefore for typical experimental cuts, for instance p% > 20 GeV, the expected zero 
in the distribution of the W-photon rapidity difference is for Ay* ~ —0.45 for the sub- 
process in Eq. (4.71). At the TEVATRON the quark and anti-quark directions correspond 
reasonably well with those of the protons and anti-protons, leading to a prediction for 
the radiation zero in Ay* that is only slightly diluted by PDF effects. At the LHC this 
is no longer the case; instead the radiation zero is expected to be located at Ay* = 0. 
The effect of the radiation zero in W*y production at the TEVATRON is shown in 
Fig. 4.29 (left), for p7"""" = 20 GeV. As expected, the zero is replaced by a dip in the 
distribution that does, however, occur at the expected value of y*. Also shown is the 
impact of the NLO corrections, which tend to diminish the effect of the radiation zero. 
Experimentally one cannot reconstruct the W and, instead, it is simpler to just use the 
lepton rapidity which retains much of the angular information from the W. However, 
it does serve to make the radiation zero less pronounced. Despite this, and further 
contamination from effects such as radiation from leptons in the W decay, evidence 
of the radiation zero has been observed at the TEVATRON [84]. This is illustrated in 
Fig. 4.29 (right), which compares DØ data [97] with the theoretical prediction for 
Qe(ny — ne). As expected, the shape of this distribution is very similar to the NLO 
one shown in the left-panel of the same figure and the excellent agreement between 
the SM prediction and the measurement confirms the presence of the radiation zero. 


4.5 Top-pair production 


The large mass of the top quark compared to all the other fermions, which in particular 
indicates a special role in electroweak symmetry breaking, means that study of top 


Top-pair production 225 


Pi Q Pi Q 


P2 Q P2 Q 


Fig. 4.30 Representative Feynman diagrams for the production of a pair 
of heavy quarks at hadron colliders, via gg (left) and qg (right) initial 
states. 


quark production is of particular significance. The discovery of the top quark was the 
main triumph of the TEVATRON, as discussed in Chapter 8. From a theoretical point of 
view, unlike the calculations performed so far, it is necessary to include the non-zero 
quark mass in order to arrive at sensible predictions. However, it is not only the mass 
of the top quark that is special but also its lifetime. Assuming that the CKM matrix 
has |Væ| = 1, the width of the top quark can be computed by considering only the 
decay t > Wb. At leading order it is given by 


r: 


= a [a = 62)? +w?(1 +82) - 2w] y1 + wt + B4 — 2(w? + B2 +wp), 
T 
(4.78) 


where the dimensionless quantities w and 8 are the masses of the decay products 
rescaled by the top quark mass, w = my/m:, 6 = my/m:. Inserting the known 
masses, this formula gives T; ~ 1.5 GeV, rather small compared to the top quark 
mass. The short lifetime of the top quark therefore sets it apart from the other light 
quarks — unlike them, it is able to decay before hadronizing. As a result there are no 
bound states of top quarks (“toponium”), but the decay products of the top quark do 
allow unique studies of its nature (cf. Section 2.1.4). 


4.5.1 Theoretical considerations 


The production of top quark pairs at hadron colliders proceeds via Feynman diagrams 
such as the ones shown in Fig. 4.30, so that the cross-section is sensitive to the gluon 
content of the incoming hadrons. This helps to explain why the rate for top quark pair 
production is significantly larger at the LHC than at the TEVATRON, to the extent that 
the LHC is often referred to as a “top factory”. The extent to which this factory, the 
LHC, will be able to investigate the properties of the top quark is clear from Fig. 4.31, 
which shows the total number of top quark pairs produced from the early days of the 
TEVATRON through projections for future LHC data-taking. 
The leading-order matrix elements for the two basic partonic processes, 


q(p1) + q(p2) —> t(ps) 4 (pa) (4.79) 


g(p1) + g(p2) — t(ps) + t(pa), (4.80) 


are, after summing and averaging over colours and spins, 
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Fig. 4.31 The total number of top quark pairs produced at the TEVATRON 
and LHC colliders, as a function of year. The dotted (blue) line indicates 
future projections, including estimated shutdown periods, as of 2015. 


2 V (#2 +4? +2m2s 
|Ma = SN? ( z ; (4.81) 
2. 1 /V_2N?\ fa 22,4 95 ani? 
|My] = VN a z t +A + 4mis 7 : (4.82) 
where the Mandelstam variables are defined by § = (pı + p2)?, Ê = (p3 — p1)? — m?, 


and û = (p3 — p2)* — m?. 
In the centre-of-mass frame, the four-momentum of the outgoing top quark can be 
written as, 


pt = (mr cosh y3, Pr, mr sinh ys) , (4.83) 


where pr is the 2-component transverse momentum and mp = Jf m? + p4. In this 
frame the top quark propagator that appears in the left-hand diagram of Fig. 4.30 can 
easily be evaluated. It is given by 


(p3 — p1)? — m? = f = —\/s zımr(cosh y3 — sinh ys), (4.84) 


where the partonic momentum pı corresponds to a probe of the incoming hadron at 
momentum fraction xı. This momentum fraction can be related to final-state kinematic 
quantities through momentum conservation by 


mr 
J5 


so that the propagator can be simplified to, 


(e% + e+), (4.85) 


Ly = 


(p3 — pi)? — m? = -mi (1+ e=). (4.86) 
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Thus the propagator always remains off-shell, since ine > m?. The same reasoning 
applies to all the propagators that appear in the diagrams for top pair production. 
The addition of the mass scale m, sets a lower bound for the propagators — that 
does not occur when considering the production of massless (or light) quarks, where 
the appropriate cut-off would be the scale Agcp. In contrast, as long as the quark is 
sufficiently heavy, mg >> Agen (as is certainly the case for top, and even bottom, 
quarks), the mass sets a scale at which perturbation theory is expected to hold. As a 
result it is possible, for instance, to provide a theoretical prediction for the inclusive 
top quark pair production cross-section. 

Although the large top quark mass therefore provides an easier framework in which 
to perform theoretical calculations, the actual calculations themselves can become con- 
siderably more complex. At the most basic level, retaining non-zero masses for external 
fermions leads to more complicated expressions for the amplitudes. This can be seen 
directly by comparing the squared tree-level matrix elements for gg — tt in Eq. (4.82) 
with those for the process gg > gg in Eq. (4.5). In one-loop calculations of virtual 
corrections, even the scalar integrals are more complex than their massless counter- 
parts. In addition, obtaining analytic expressions for the loop amplitudes themselves 
is more complicated since the spinor helicity formalism for massless particles does not 
readily extend to the massive case. This means that, for instance, the analytic on-shell 
unitarity methods discussed earlier are not immediately applicable to processes con- 
taining heavy quarks. In fact, although massive fermions cannot be defined in terms 
of states of definite helicity, it has been possible to adapt the spinor methods appro- 
priately [683] to obtain amplitudes for some processes of interest, including top quark 
pair production [186]. Note that the numerical unitarity-based methods discussed in 
earlier sections do not suffer from most of these problems and are therefore well-suited 
to calculations involving heavy quarks. 

Theoretical predictions for top pair production cross-sections are shown in Fig. 4.32. 
In addition to the total inclusive cross-section, predictions are also shown for produc- 
tion of an additional jet. Even at 7 TeV there is copious production of additional jets 
with transverse momenta of 40 GeV or more, with such events accounting for approx- 
imately 50% of the total cross-section. At higher energy collisions this fraction grows 
significantly, with about 80% of all top pair events accompanied by a 40 GeV jet at 
ys = 100 TeV. Such a large proportion of events containing an additional jet calls 
into question the pertubative description of these cross-sections. Note though that, at 
a theoretical level, the situation can be easily improved by choosing a higher jet pr 
threshold, for instance 100 GeV. For associated production with two jets, results have 
been obtained using a numerical unitarity approach, dubbed HELAC-—NLO [243, 244]. 
A closely related calculation, for the case where the two associated jets originate from 
b quarks, is important as an irreducible background to Higgs boson production in the 
ttH channel, with H — bb. NLO corrections for this case have been computed by 
several groups [242, 284, 285]. 

One can move beyond the NLO calculation for the total top cross-section in a 
number of ways. In the preceding discussion the top quarks are considered to be stable 
particles in the theoretical calculation. In reality the top quark is only short-lived and 
decays to a bottom quark and a W boson that itself subsequently decays. The decays 
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Fig. 4.32 Top pair production cross-section at proton-proton colliders 
as a function of centre-of-mass operating energy, v/s (solid). Also shown 
are the cross-sections for production in association with a jet satisfying 
pr > 40 GeV (dashed) and pr > 100 GeV (dotted). 


of a pair of top quarks into the different possible combinations of leptons and jets form 
the overall branching fractions shown in Fig. 4.33. Since BR:,w» œ~ 1 due to other 
modes being CKM-suppressed, these branching fractions correspond to the product of 
two W branching fractions to very good approximation. The dilepton decay mode can 
be most accurately measured but only captures a relatively small fraction of top quark 
decays. The converse is true for the “all-jets” channel, so that a good compromise is 
often found by concentrating on lepton+jets final states. This will be explored further 
in Chapter 8 and Chapter 9. 

One way of accounting for the decay is by using a spin-density approach as in 
Ref. [236, 237], which has been used to provide predictions at NLO. Alternatively 
one can directly compute amplitudes including the decay, taking advantage of the 
considerable simplification that occurs due to the left-handed nature of the W inter- 
action [312, 683]. With this type of approach it is possible to include NLO effects in 
both the production and decay of the top quarks [312, 762]. Example diagrams that 
illustrate the division between production and decay stages are shown in Fig. 4.34, for 
the case of the real radiation contribution. The division of the process into production 
and decay stages can be performed only in the limit of top quarks that are produced 
exactly on-shell. The quality of the approximation is then reliant on the fact that the 
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Fig. 4.33 Branching fractions of a top quark pair into leptons and jets. 


(b) 


Fig. 4.34 Diagrams entering the real radiation calculation of top pair 


production in the (a) production and (b) decay stages. The double-barred 
line indicates that the top quark propagator is considered on-shell at that 
point in the diagram. 


top quark width is quite small, with corrections expected of order T';,/m;. In general, 
including NLO corrections in the decay of the top quark has only a very small effect 
on observable quantities. One distribution that is subject to non-trivial corrections is 
the invariant mass of the lepton and b jet produced in the top quark decay, mæ [762]. 
This observable, and closely related ones — such as the invariant mass of the lepton 
and b meson observed in the top decay — are interesting since they could be used 
to measure the top quark mass [256]. This is possible since the distribution shows a 
distinct kinematic edge, as shown in Fig. 4.35, whose position and shape is sensitive to 
m. However, since the position of the edge is dictated by the kinematics of the decay 
process, it can be modified by the inclusion of gluon radiation, as is clear from the 
figure. Hence an extraction of the top quark mass from such a study is best performed 
with the inclusion of NLO effects in the decay. 
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Fig. 4.35 The invariant mass distribution mg, expected at the 7 TeV 
LHC, for typical experimental selection cuts. Predictions are shown at LO 
and at NLO, with the latter computed both with and without including 
NLO effects in the top quark decay. 


Although the calculations that consider top quark production and decay in a fac- 
torized manner can be extended to higher jet multiplicities, the accuracy of the ap- 
proximation — dropping non-resonant and non-factorizable contributions — can only 
be judged in the light of a more complete approach. Calculations of the NLO correc- 
tions away from the resonance regions are available for the final states WtW bb [459] 
and v.e*be~ Deb [245]. Fig. 4.36 shows the three classes of diagrams that must be con- 
sidered in these calculations, where either two, one, or no top quark propagators may 
be resonant. A detailed study [134] of the two approaches in Refs. [459, 762] showed 
that, as expected, the doubly resonant approximation of Ref. [762] is adequate for 
many distributions, resulting in differences of the order of a few percent compared to 
the full calculation in Ref. [459]. However, some observables, such as the transverse 
momentum of the bb pair, differ by as much as 10-20% for pr > 250 GeV. The im- 
proved predictions, throughout a wide kinematic range, provided by these calculations 
enable a better assessment of these backgrounds in new physics searches. 

Another avenue is the calculation of terms in the perturbative expansion beyond 
parton-level NLO. The total cross-section for tt production is known to NNLO [427] 
and first results have been presented for differential distrbutions [265, 428]. The calcu- 
lation of the total cross-section to this order represented a tremendous breakthrough 
in the field of NNLO computations since it was the first calculation involving mas- 
sive quarks. Fig. 4.37 demonstrates that the resulting theoretical prediction is under 
excellent control, with a residual 10% uncertainty that accounts for both scale and 
PDF uncertainties. Note that the accuracy can be further improved in this case by 
resumming large logarithms that appear as a result of producing a top pair close to 
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Fig. 4.36 Example diagrams for the process gg > vee* be ib, illustrat- 
ing three categories that may enter the calculation. Clockwise from top left: 
double-resonant contributions, with two top quark propagators that may 
be on-shell; single-resonant, with only one such propagator; non-resonant, 
containing no top propagator that could be resonant. 


threshold. This type of resummed calculation will be discussed further in Chapter 5. 
Finally, the top quark production process was one of the first for which parton shower 
predictions were available including the effects of NLO corrections [544]. This type of 
prediction will be discussed at length in Chapter 5. 

With the large samples of top quark events that have been collected at the TEVATRON, 
that has already been increased by two orders of magnitude in the 7 and 8 TeV runs 
of the LHC, precision studies of the properties of the top quark can be performed. Two 
particularly interesting examples are observables that are sensitive to the top quark 
mass and those that probe top quark charge asymmetries. These topics are considered 
in more detail in the next sections. 


4.5.2 Top quark mass 


The mass of the top quark has a particularly important role in the SM. Through 
radiative corrections it affects the mass of the W boson and, indeed, precision mea- 
surements of the latter provide an indirect determination of m;. Due to its large mass 
compared to the other fermions, the Higgs boson also couples relatively strongly to the 
top quark. As a result, simultaneous measurements of my, m;, and my can provide 
a stringent test for the presence of BSM physics. 
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Fig. 4.37 Top quark production cross-section at the LHC as a function of 
the operating energy. Reprinted with permission from Ref.[427]. 


150 


Given the importance of the top quark mass, an accurate determination of its 
value directly from experiment is highly desirable. However, the situation is compli- 
cated by the fact that the top mass parameter that appears in the SM Lagrangian, the 
fundamental parameter of the theory, is subject to renormalization at each order of 
perturbation theory. As a result, a perturbative calculation at a given order replaces 
this fundamental parameter, normally referred to as the pole mass mų, by a renormal- 
ization scheme-dependent running mass, m¿(ur). The relationship between the two 
quantities can be computed in perturbation theory and is currently known up to four 
loops [752]. For instance, in the MS scheme the pole and running masses are related 
by 


MS as ag \? 
mi = m®5 (uz) h TEE co ( ) ! adl (4.87) 
T T 
where the coefficients c1, c2 are known. At NLO only the one-loop coefficient is re- 
quired, cı = 4/3 + log[u%/m:(r)*] and, evaluating the running mass at the scale 
LR = My gives, 


30 


Hence the pole mass is approximately 5%, or 8 GeV, larger than the equivalent NLO 
MS pole mass. The distinction between the two quantities is therefore of great impor- 
tance numerically, not just for theoretical consistency. Conventional extractions of the 
top quark mass, for instance in most of the original TEVATRON analyses, implicitly as- 


MS 4 
Mme = mMSNLO (7) (1 + sas) . (4.88) 
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Fig. 4.38 Dependence of the NLO and approximate NNLO top pair 
cross-section on the running top quark mass, m(m). The data point with 
vertical error bars represents the experimental measurement of the cross— 
section given in Ref. [87] and the horizontal error bars the corresponding 
uncertainties on the extraction of m(m) at the two orders of perturbation 
theory. Reprinted with permission from Ref. [720]. 


sume the extraction of the pole mass through kinematic fits of distributions. However, 
due to the fact that the top quark is a coloured object, it suffers from residual theo- 
retical uncertainties in its definition of the order of 1 GeV [860]. A more well-defined 
extraction of the top quark mass may be performed by exploiting the dependence of 
the top pair production cross-section on the mass. The running mass can be extracted 
order by order in perturbation theory by comparing the cross-section prediction, com- 
puted at the same order, with the experimentally measured value. This procedure, 
which was first used in Ref. [720], is illustrated in Fig. 4.38. The resulting NLO and 
approximate NNLO running masses are very consistent and, upon converting to the 
equivalent pole mass, also agree well with other determinations. A number of alter- 
native determinations have been either proposed or implemented, typically based on 
either kinematic endpoints or clean measurements of leptonic top decays; for a detailed 
discussion of methods and projections for the LHC the reader is referred to a recent 
review of this topic [650]. 


4.5.3 Top quark charge asymmetries 


An interesting feature of top pair production is realized for the first time in the next-to- 
leading-order calculation of the inclusive cross-section. In contrast to the leading-order 
prediction, at NLO top quarks are not produced with the same distribution in rapidity 
as anti-top quarks [700]. A sketch of the expected rapidity distributions of top and 
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Fig. 4.39 Sketch of the rapidity distributions of top and anti-top quarks 
expected at the TEVATRON (left) and LHC (right). 


anti-top quarks is shown in Fig. 4.39, from which the asymmetries at the TEVATRON 
and LHC can immediately be seen. This asymmetry arises from corrections to the 
qq-initiated process, as indicated in Fig. 4.40. At the TEVATRON the asymmetry is 
clearest: top quarks are not produced equally in the forward and backward regions, 
where “forward” is defined with respect to one of the beam directions. At the LHC an 
asymmetry does not arise in the same fashion since the strong interaction is parity- 
invariant and the pp initial state is an eigenstate of parity. However, as seen in Fig. 4.39 
(right), there is a difference in the rapidity distributions of the top and anti-top quarks, 
which is due to the sub-leading gq and qg initial states. A charge asymmetry can be 
formed by considering the difference between the distributions in the forward and 
central regions, but it is very small and thus hard to measure, cf. Chapter 9. 

In contrast, the asymmetry at the TEVATRON is a relatively prominent effect that 
will be discussed in more detail. At NLO the asymmetry is induced primarily from 
the interference between the Born amplitudes for gg —> tt 2, and the part of the 
one-loop amplitude that is anti-symmetric under exchange of the quark and anti- 
quark [701, 702]. This contribution arises from box diagrams such as the one shown in 
Fig. 4.40 (right). Note that this is an example of a loop-induced effect that does not 
change the magnitude of the tt cross-section but does affect the kinematic distributions 
of the final state. This type of asymmetry is not unique to the tt system and, in fact, 
it was first discovered in the calculation of QED corrections to the process ete~ —> 
ut po [218]. 

A second type of contribution to the asymmetry arises from diagrams in which 
a hard parton is radiated, as shown in Fig. 4.40 (left). The interference of diagrams 
with initial and with final-state gluon radiation leads to a smaller asymmetry in the 
opposite direction, with a size that depends on the transverse momentum of the top 
quark pair. At large pr (tt) the pair recoils against a radiated hard gluon. In the leading 
colour approximation that is, neglecting contributions of order 1/N?, the gluon is 
colour-connected to either the t-q pair, or the t-q pair (where q, q are the initial state 
quark and anti-quark). Since gluon radiation indicates an accelerating colour charge, 
if the gluon is radiated from the t-q pair then the top quark is more likely to have 


2There is no asymmetry from the gg initial state. 
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Fig. 4.40 Representative next-to-leading order diagrams responsible for 


Q 


the top quark charge asymmetry. A real radiation diagram is shown on the 
left and a loop diagram on the right. 


accelerated in order to change direction and be produced in the direction opposite to 
the incoming quark. Hence, assuming that the quark direction is from the negative 
rapidity hemisphere to the positive one, one can make a qualitative prediction for the 
lab frame asymmetry Af’, that is defined by 


a 


Att. — Z > 9) — olu < 9) 
lab a(t > 0) oly, <0) 


(4.89) 


The asymmetry is expected to be negative for sufficiently hard emission and to in- 
crease in absolute value with p7r(tt).? This expectation is predicated on the assump- 
tion that the beam from which the light quark in the initial state was produced can 
be determined. As such, it is only a useful observable at the TEVATRON where the 
asymmetric collisions between protons and anti-protons allow one to use the proton 
beam as a proxy for the quark direction. The expectations for the asymmetry aris- 
ing from the real radiation contribution is borne out by the NLO prediction for Af‘, 
shown in Fig. 4.41. As seen in the figure, the virtual contribution — that because of 
the 2 > 2 kinematics contributes only at exactly pr(tt) = 0 — has the opposite sign. 
To obtain a more useful prediction at low pr, one can consider the results ob- 
tained using a parton shower. Such a prediction is shown schematically in Fig. 4.41. 
The parton shower prediction interpolates smoothly between large positive values at 
small pr(tt) and negative values at large top quark pair transverse momentum. The 
inclusive asymmetry, obtained by integrating out the dependence on pr(tt), is small 
and positive. One expects that its value should not be altered by the parton shower, 
although further studies have indicated that in fact this is not necessarily the case, due 
to colour connection effects that may be accounted for in the shower treatment [859]. 
Finally, one must remember that, although the asymmetry arises at NLO in the cross- 
section, the non-zero value of the asymmetry is strictly a leading-order prediction. As 
such it suffers from the usual scale uncertainties, exacerbated by the O(a?) nature of 
the observable, resulting in a theoretical uncertainty due to unknown higher orders 
of around 40%. The calculation of the NNLO corrections to the inclusive asymme- 


3This intuitive understanding must be modified somewhat when considering the NLO corrections 
to tt+jet production [763]. The presence of a second scale in the process (pis in addition to mz) 
leads to large logarithms in the ratio of the two scales that reduce the predicted asymmetry to a 
value near zero. This is another lesson in the importance of large logarithms when in the presence of 
two disparate scales. 
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Fig. 4.41 The lab frame top quark forward—backward asymmetry, as de- 


fined in Eq. (4.89), at the TEVATRON, The asymmetry is shown as a func- 
tion of the top quark pair transverse momentum, pr (tt). The NLO predic- 
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tion is shown in red, with a single bin at zero from the virtual corrections, 
and a NLO+parton shower prediction is shown schematically in black. 


try [428] indicate that they are important, at the level of 15-30%. This is crucial in 
helping to reconcile the theoretical prediction with the TEVATRON data, as will be 
discussed further in Section 8.6. 


4.6 Single-top production 


The term single top production encompasses a number of parton level processes that 
all produce a single top quark via the flavour-changing electroweak vertex coupling a 
W boson to a top and bottom quark. Although these processes are suppressed relative 
to QCD production of top quark pairs, since they proceed through an electroweak 
interaction, they are favoured kinematically since a smaller partonic § is required in 
order to produce only a single heavy top quark. The Feynman diagrams depicting the 
three such processes that can be explored at hadron colliders are shown in Fig. 4.42. 

These processes are important to measure experimentally for a number of reasons. 
Foremost, they provide a prototype search for the types of final state that are expected 
in many new physics scenarios. Specifically, they produce bottom quarks, leptons and 
missing transverse energy with rates that are similarly challenging. Moreover, since 
the production cross-section for these processes is proportional to |V}|?. they provide 
the possibility of directly measuring the CKM matrix element Vip. 

The relative importance of the three single-top production processes is illustrated 
in Fig. 4.43. At all energies the t-channel exchange of a W-boson is the dominant 
mechanism for producing a single top quark. At the TEVATRON the proton—anti-proton 
colliding beams mean that s-channel production through a virtual W boson proceeds 
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Fig. 4.42 Representative leading-order diagrams for the three single-top 
channels, s channel (top), t channel (bottom, left), and associated Wt 
production (bottom, right). 


at a significant rate, about half that of the t-channel process. Conversely, the associated 
production of a top quark with a W boson is practically inaccessible at the operating 
energy of 1.96 TeV. At the LHC the situation for the sub-dominant modes is reversed. 
This is due to the fact that it is far more likely to find a gluon than an anti-quark 
in the high-energy proton beams of the LHC. The discovery of single top production 
at the TEVATRON will be discussed in Chapter 8 and the more detailed measurements 
carried out at the LHC in Chapter 9. 


4.6.1 Theoretical issues 


The various single top production channels provide a number of different theoretical 
challenges. The need to retain the mass of the top quark means that the calculations 
are more complex than similar 2 —> 2 processes such as jet production. A further 
difficulty is the fact that the separation of the channels is strictly only possible at the 
first few orders of perturbation theory. To see how this works, consider the partonic 
process 

9(p1) + a(p2) > t(p3) + b(p4) + q' (ps) (4.90) 


that enters at one order higher in the strong coupling than the processes depicted in 
Fig. 4.43. This amplitude receives contributions from Feynman diagrams such as the 
ones shown in Fig. 4.44, which are part of a single gauge-invariant set and so must 
be considered together. From the form of the diagrams it is clear that they could 
enter the calculation of next-to-leading-order corrections to either the s- or t-channel 
processes, but given that they interfere it is not immediately clear how the interference 
should be apportioned. However, the issue can be quickly resolved by inspection of the 
colour structure of the different contributions. Using the labelling in Eq. (4.90), the 
contributions of the two diagrams shown in Fig. 4.44 are 
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Fig. 4.43 Single top cross-section as a function of the hadron collider 
operating energy ys. For ys < 4 TeV the initial state is proton—antipro- 
ton, i.e. appropriate for the TEVATRON, while in the rest of the range it 
is proton-proton, i.e. relevant for the LHC. This causes a noticeable dis- 


continuity in the s-channel cross-section due to the dependence on the 
anti-quark PDFs. 
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where K(®) and K® contain all the kinematic information such as spinors and gamma 
matrices. When these diagrams are squared the result is 
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where the interference term vanishes since the colour matrices are traceless, T;*}, = 
Tr [£7] = 0. Therefore it is possible to simply attribute contributions of the form of 
Fig. 4.44 (left) to the NLO calculation of the t-channel process and Fig. 4.44 (right) to 
the effects of NLO in the s channel. However, this argument clearly relies on the fact 
that one of the fermion lines remains free of any coloured interactions, which is reflected 
in the 6 factors in Eq. (4.91). In the presence of further radiation these can be replaced 
by colour matrices, so that analogous interference effects do not vanish. Therefore the 
clean separation of channels breaks down at NNLO and beyond. Nevertheless, the 
NNLO computation can be performed in the approximation that such contributions 
are neglected and, in this fashion, fully differential results for the t-channel process 
have already been presented [288]. 

Both the t-channel and associated production modes rely on diagrams that contain 
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(a) (b) 
Fig. 4.44 Sample Feynman diagrams for the process q + g > t+b+ g. 
These diagrams may be attributed to the NLO calculation of t- and 
s-channel single top production (left and right, respectively). 


a bottom quark in the initial state. Usually one does not consider any intrinsic bottom 
quark content in the proton; instead it is generated perturbatively from the gluon 
and light quark distributions through evolution equations above the bottom quark 
threshold. Explicitly, at the first order, 


z 2 1 dz 
ha) = S100 (45) | S Pun (Zn?) +0(02) (4.98) 
b x 


which indicates that the leading contribution to the b-quark PDF is from gluon split- 
ting, g — bb, in the proton. Rather than accounting for this effect in the PDF, it is 
often useful to instead directly compute the t-channel and Wt processes with the gluon 
splitting present in the matrix elements. Specifically, an equivalent description of the 
diagrams shown in Fig. 4.42 (b) and (c) is provided by the diagrams shown in Fig. 4.45. 
One motivation for proceeding in this way is that experimental analyses of these final 
states often attempt to separate the single top processes from backgrounds by making 
use of the presence of the additional b quark that is produced in the gluon splitting. 
However, such a calculation can be considerably more complicated. For instance, the 
inclusion of the b-quark mass is necessary in order to render a finite result for the 
diagrams in Fig. 4.45. Without the b-quark mass there would be a collinear divergence 
when the b quark is not explicitly observed, meaning that an inclusive cross-section 
could not be defined. 

Although explicit information about the anti-bottom quark has been lost in the 
original approach, logarithms associated with the splitting have been included to all 
orders in the PDF evolution through the O(a?) terms that are indicated in Eq. (4.93). 
At the first order in the evolution the diagrams with the explicit gluon splitting are 
recovered, albeit in the collinear approximation. If the logarithmic terms, of order 
as log (u?/m?) — with u typically chosen as a physical scale related to the process 
such as pr(b) — are important then this approach may be superior. Of course, as 
calculations in the two schemes are performed at successively higher orders, the results 
of the calculations should agree to a better degree. The often-used nomenclature for 
the two calculational schemes contrasts the number of quark flavours included in the 
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Fig. 4.45 An alternative description of the processes depicted in Fig. 4.42 
(b) and (c). The initial bottom quark has been replaced by a gluon that 
splits into a bottom—anti-bottom quark pair. 


initial state: 4-flavour (4F) for the diagrams of Fig. 4.45 and 5-flavour (5F) when 
computing instead the diagrams of Fig. 4.42 at LO. A comparison of the predictions of 
the two schemes for t-channel single top cross-sections at the LHC is shown in Fig. 4.46, 
both at LO and at NLO. At NLO, once the uncertainty from the choice of scale and 
the PDF sets is included, the two calculations are only in marginal agreement. More 
importantly, the two calculations give access to kinematic quantities at different levels 
of precision. Despite the fact that the 5F calculation is performed to NLO, predictions 
for any properties of the spectator anti-bottom quark only enter the calculation in the 
real corrections. They are therefore predicted only at leading order. In contrast, the 
AF calculation by definition is already sensitive at LO. Indeed, the predictions for such 
properties are identical in the 5F NLO and 4F LO calculations. However, the 4F NLO 
calculation raises the precision of the observable to the NLO level, where significant 
deviations from the LO expectation may be observed. 

A further subtlety arises in the consideration of the associated top production 
process. A careful inspection of the diagrams involved in the NLO calculation of the 
5F process reveals, in addition to the LO 4F diagrams such as the one in Fig. 4.45 
(right), contributions from diagrams such as the one shown in Fig. 4.47. This is none 
other than a diagram for tt production, with the decay f > W~b included. Since such 
diagrams, proceeding through resonant top pair production, provide the dominant 
contribution to the cross-section, a method for effectively excluding them must be 
devised in order to obtain a useful prediction for the single top tW- final state. Since 
gauge invariance requires that all contributing diagrams be included, it is not possible 
to remove them from the calculation. A number of methods have been proposed for 
achieving this goal, for instance applying a cut on the mass of the W~b system in 
order to remove the resonant top mass region, or insisting on a b-jet veto at moderate 
pr. Note that this issue would be present even in the LO calculation of the 4-flavour 
scheme, with the diagram shown in Fig. 4.47 part of the leading-order contribution. 
A more sophisticated treatment is to include all diagrams that contribute to this sort 
of final state, namely pp + et vebu” Dub, and to then see how this compares with the 
approximate separation into various channels. This sort of calculation is now possible 
and the NLO results that have been presented indicate that it is possible to make a 
reasonable approximation in this way [534]. 
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Fig. 4.46 Single top t-channel cross-section at the 14 TeV LHC, computed 
in the 4- and 5-flavour schemes. Scale uncertainty is indicated by the dotted 
lines and the total, considering also the uncertainty from the PDFs, is 
shown as a dashed line. Figure based on the results of Ref. [317]. 


© 


g t 


Fig. 4.47 A diagram entering the NLO calculation of Wt single top as- 
sociated production. It is related by gauge invariance to the one shown in 
Fig. 4.45 (right) and represents the process gg — tt followed by the decay 
t+ Wb. 


4.7 Rare processes 


This chapter has concentrated on the hadron collider processes with the largest cross- 
sections, or ones that lead to formidable SM backgrounds. This section discusses a 
variety of other notable processes that will serve as new benchmarks in the future 
high-luminosity LHC era. The cross-sections for the processes that will be discussed 
here, for the LHC operating at ys = 13 TeV, are shown in Fig. 4.48. Note that most 
of the cross-sections are at the level of a few hundred femtobarns or are smaller still, 
before any branching ratios into observable final states are taken into account. Once 
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Fig. 4.48 Cross-sections, in picobarns, for a selection of rare processes 
at hadron colliders: the production of a top quark in association with a 
vector boson, tri-boson production, and four-top production. The NLO 
cross-sections are taken from Ref. [148]. When photons are present in a 
process, the cross-section corresponds to the basic cuts pù > 20 GeV and 
[n| < 2. 


these are taken into consideration, the expected event rates for many of these processes 
are so small that their observation will require high-luminosity datasets from future 
LHC runs. 

For the first category of processes in the figure, top pair production in association 
with a vector boson, the outlook is not so gloomy. Indeed, first evidence for produc- 
tion of these final states at the SM level has already been established in Run I of the 
LHC [46, 175, 403, 406]. These processes are important for a number of reasons. All 
three represent significant backgrounds to multi-lepton signals that, for instance, are 
mainstays of SUSY searches. In particular, the ttW=® and ttZ processes constitute a 
non-trivial source of same-sign dileptons within the SM. The ttW= final state also 
exhibits a charge asymmetry analogous to the one already discussed for top pair pro- 
duction; in this case the emission of a W boson effectively polarizes the top quarks, 
leading to a significant O(15%) charge asymmetry that should be observable [738]. 
In contrast to the case of ttW* production, where the W boson has only an indirect 
effect on the top quarks, ttZ production directly probes the coupling of the Z boson to 
top quarks. Direct constraints on the nature of this coupling should be possible with 
future LHC data [829], where a precise knowledge of the SM cross-section that takes 
into account both NLO QCD and electroweak effects [541] will be invaluable. 

The second category of processes comprises final states containing three vector 
bosons: “tri-boson” production. Once again, they constitute important backgrounds 
for not only BSM searches, but for ongoing probes of the Higgs boson. For example, 
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some of these processes represent irreducible backgrounds to the associated production 
modes of the Higgs boson, pp > WH and pp — ZH when the Higgs boson decays to 
vector boson pairs. Besides this important role, these processes have intrinsic interest 
due to their ability to probe anomalous gauge boson couplings. Although triple gauge 
couplings are probed — in general, much more effectively — by diboson processes, 
quartic couplings are probed for the first time in tri-boson processes. The production 
rates for these processes, before any branching ratios, are all rather similar and in 
the 10-100 fb range. Branching ratios into clean final states that could be identified 
experimentally reduce these to the femtobarn level or smaller. At that level it is clear 
that, unless anomalous interactions are very large, detailed investigation of these final 
states will not be possible until hundreds of inverse femtobarns of data have been 
analysed. 

The final process shown in Fig. 4.48, with a cross-section of only about 10 fb, is 
four-top production. Although this cross-section is tiny, even before top quark decays 
are accounted for, this process would have a spectacular signature in the detectors. 
The heavy nature of the top quark means that many models of new physics beyond 
the SM involve additional interactions of the top quark with new degrees of freedom. 
As a result such models usually predict a much-enhanced cross-section for four-top 
production. This means that, despite the small rate expected in the SM, this final state 
has already attracted considerable interest in the context of BSM searches. Despite 
the complexity of this final state — containing four coloured, heavy particles — NLO 
predictions for this cross-section in the SM have been known for some time [246]. 

Another class of rare processes that will be extensively probed with future data 
from the LHC corresponds to vector boson scattering. These will be discussed in Sec- 
tion 4.8.8 below. 


4.8 Higgs bosons at hadron colliders 
4.8.1 Overview 


The discovery of a new particle in 2012, consistent with the predictions of a SM Higgs 
boson, represented the culmination of many years of careful experimental and theo- 
retical study. One reason for this is simple — the SM does not provide a prediction for 
the mass of the Higgs boson. This means that dedicated searches had to be performed 
over a wide range of potential masses. On the other hand, once a mass is assumed 
the SM is very predictive in the sense that its interactions are completely prescribed. 
This means that, at hadron colliders, the Higgs boson may be produced, and subse- 
quently decay, via many mechanisms. The theoretical study of the various final states 
in which a Higgs boson may be detected, and the development of suitable experimental 
techniques with which to extract a viable signal, has been a driving force in hadron 
collider studies for many years. Due to the expansive nature of the topic it is not 
possible to discuss all the intricacies of the search for a SM Higgs boson here. Instead, 
this section will focus on the main themes that have emerged from these studies and 
concentrate on the issues of particular relevance in the era following the LHC discovery 
of a Higgs boson with a mass of approximately 125 GeV. Select experimental results 
will be discussed in more detail in Section 9.6. 
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Fig. 4.49 Feynman rules for the coupling of the SM Higgs boson to W 
and Z bosons (left) and to fermions (right). 


To begin, it is useful to review the ways in which the Higgs boson, in its usual 
incarnation, may couple to other SM particles. Since the particle was originally intro- 
duced as the agent of electroweak symmetry breaking, responsible for giving the W 
and Z bosons non-zero masses, the Higgs boson has tree-level couplings to those par- 
ticles. The interactions and the corresponding Feynman rules are shown in Fig. 4.49 
(left). The nature of the Higgs boson means that all the couplings are proportional 
to the masses of the particles with which it interacts (recall that, at tree level, the W 
and Z masses are related by mw = mz cos ĝw). If these were the only interactions of 
the Higgs boson then opportunities for its observation at a hadron collider would be 
limited to channels with particularly small cross-sections, since production would pro- 
ceed through Feynman diagrams with multiple powers of the weak coupling. The two 
such mechanisms that are of primary importance are shown in Fig. 4.50. Associated 
production refers to the modes in which the Higgs boson is produced together with 
an additional W or Z boson. The last production mode, referred to as weak boson 
fusion or vector boson fusion, is especially interesting since it has a very clear 
experimental signature. The quarks only receive a moderate transverse kick (typically 
of order my /2) in their direction when radiating the W or Z bosons, so they can be 
detected as jets very forward and backward at large absolute rapidities. At the same 
time, since no coloured particles are exchanged between the quark lines, very little 
hadronic radiation is expected in the central region of the detector. Therefore the type 
of event that is expected from this mechanism is often characterized by a “rapidity 
gap” in the hadronic calorimeters of the experiment. 

Although not strictly part of the original formulation of the Higgs boson interac- 
tions, it is usual to consider the SM Higgs boson as one that also provides a mechanism 
by which fermions acquire masses. The Higgs boson then has Yukawa interactions with 
all the fermions, with a strength proportional to the fermion mass, my. This interac- 
tion is shown in Fig. 4.49 (right) and, in the SM, it is the coupling to the top quark 
that is especially relevant due to the large top quark mass. In particular it leads to the 
channel in which the Higgs boson is produced in association with a pair of top quarks, 
through leading-order diagrams such as the ones shown in Fig. 4.51. 

Finally, but most importantly, the tree-level interactions described above lead to 
couplings of the Higgs boson to light particles that are mediated by loop diagrams. 
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Fig. 4.50 Feynman diagrams for the production of a Higgs boson through 
electroweak interactions alone. The three production modes are associated 
production of W~H (left) and of ZH (centre) and weak boson fusion 
(right). 


Fig. 4.51 Representative Feynman diagrams for the production of a Higgs 
boson in association with a top quark pair. 


For hadron colliders the key coupling is that of the Higgs boson to two gluons, through 
loops of heavy quarks, as shown in Fig. 4.52. Since there is no tree-level coupling of the 
Higgs boson to two gluons it is clear that there is no way to renormalize any possible 
divergence that this loop diagram could contain. Therefore its contribution must be 
finite. Explicitly, the colour- and spin-averaged matrix element squared is given by, 


(4.94) 
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where v? = (/2G)~! = (246 GeV)? is the squared vacuum expectation value of the 
Higgs field. The function I,(x) is defined by, 


I, (x) = 4x [2 + (4z — 1)F(z)], (4.95) 


where the loop function F(x) is sensitive to whether the top quark in the loop is above 
or below the pair-production threshold, 


Fle) = fe [log (1 + VI = 42)/(1— VI—4a)) - in]? z< (4.96) 
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Although the matrix element is formally suppressed by two powers of a, and a loop 
factor, in the calculation of the cross-section the gluon parton distribution function 
enters twice. Thus, despite the loop suppression, this coupling actually results in the 
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Fig. 4.52 The one-loop diagram representing Higgs production via gluon 
fusion at hadron colliders. The dominant contribution is from a top quark 
circulating in the loop, as illustrated. 


largest cross-section at the LHC. This mode is usually referred to as Higgs production 
by gluon fusion. 

Before going on to discuss each of the production modes in turn, it is useful to 
consider the size of their cross-sections. Fig. 4.53 shows the cross-sections for each of 
the Higgs production processes at a pp collider, as a function of the c.o.m. energy. The 
gluon fusion mode is larger than all other modes combined by an order of magnitude 
across the range shown, with a cross-section in the range of a few tens of picobarns at 
LHC energies. In contrast, associated production with top quarks is the mode with the 
smallest cross-section, a few hundred femtobarns. However, the relative importance of 
this mode grows far more rapidly with ys than the VBF and associated production 
modes. This is mainly due to the fact that this channel benefits from the gluon flux that 
is increasingly important as the typical Feynman-x value that is probed decreases as 
y's rises. Despite this fact, the VBF mode remains the second-largest cross-section at 
all foreseeable future hadron collider energies. Also shown, for comparison, is the Higgs 
boson pair production cross-section. This cross-section is so small that, even with the 
full 3000 fb7! of integrated luminosity anticipated at the LHC in the future, one could 
only expect to produce about 10,000 Higgs boson pairs in total. After accounting 
for Higgs boson branching fractions and experimental efficiencies, this leaves very 
few events to analyse. Although this cross-section also rises sharply with ys, based 
on cross-sections alone one might expect it as difficult to observe Higgs boson pair 
production at a 100 TeV collider as to definitively establish the associated production 
modes of the Higgs boson at the LHC. 


4.8.2 Higgs boson decays 


An experimental observation of the Higgs boson can of course only be inferred from 
a measurement of the particles into which it decays. The Feynman rules in Fig. 4.49 
also represent the tree-level processes by which the Higgs boson decays into a pair 
of W or Z bosons, or a fermion—anti-fermion pair. Although the mass of the Higgs 
boson that has been observed at the LHC is below the threshold for two on-shell vector 
bosons, the partial width is still significant. For my < 155 GeV, the decay can be well 
described by considering only one of the vector bosons off-shell, the decay H —> V V*. 
The partial width for such a decay, including a factor of two to account for either one 
of the bosons to be off-shell, is given at tree level by, 
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Fig. 4.53 Cross-sections for the production of a SM Higgs boson of mass 
125 GeV at a pp collider, as a function of the operating energy, ys. 
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and the symmetry factor accounts for two identical Z bosons, Sz = 2 and Sw = 1. 
For a heavier Higgs boson above the diboson threshold this reduces to the simpler and 
more well-known result (cf. the NWA given in Eq. (2.83)) 
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Fig. 4.54 Representative Feynman diagrams for the loop-induced cou- 
pling of a Higgs boson to two photons. 


after the factor of two that is no longer appropriate is removed. 
The partial width into a fermion pair is generated at tree level and is straightfor- 
wardly given by, 
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T(H > ff) = d; Z f ( i) , (4.100) 
where dy is a factor representing the additional possible colour degrees of freedom 
for the fermion. For quarks df = 3, the number of colours, while for leptons dy = 1. 
Importantly, the branching ratio in Eq. (4.100) is proportional to the mass-squared of 
the fermion, so that in the SM the largest fermionic branching ratio is to a pair of the 
heaviest kinematically possible quarks, bottom quarks for mz = 125 GeV. The partial 
width into tau pairs is also significant since it provides a non-negligible coupling to 
leptons. The widths into pairs of charm quarks or muons are very small but could 
possibly be probed with very large datasets in the future. Decays into lighter quarks 
and electrons are too rare to have any chance of observation at the LHC. 

The gluon-fusion production diagram, illustrated in Fig. 4.52, can also be read as 
the one-loop decay of the Higgs boson into a pair of gluons. This partial width is not 
negligible, accounting for about 10% of all Higgs boson decays for my = 125 GeV. Of 
course, the resulting events are rather difficult, if not impossible, to observe directly 
at a hadron collider due to the fact that jet backgrounds are overwhelming. 

On the other hand, a very important loop-induced Higgs boson decay is the chan- 
nel H — yy, which receives contributions from diagrams such as the ones shown in 
Fig. 4.54. Although the rate for this decay is suppressed by two powers of the elec- 
tromagnetic coupling and a loop factor, it is very important because the two photons 
can be reconstructed very cleanly in the detector. The squared matrix element for this 
decay is 
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(4.101) 
In this equation the contribution of the top, bottom, and W-boson loops is shown 
separately, in terms of the function [,(x) already introduced in Eq. (4.95) and 


Tw(x) = —2[(6x + 1) + 6x(2x — 1)F(2)}. (4.102) 
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The function F(x) is defined in Eq. (4.96). It is instructive to evaluate these contri- 
butions at their physical values, in this case using mẹ = 4.5 Gev, m, = 175 GeV, 
my = 80.4 GeV and a Higgs boson mass of 125 GeV. Writing the terms in the same 
order as in Eq. (4.101) the result is 


ie ef Grmi; 


= a , 2 3 
= Ten? 8V2? |(1.838) + (—0.016 + 0.0192) + (—8.323)|". (4.103) 
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The W-boson and top quark contributions enter with opposite signs and therefore 
interfere destructively. The contribution of the bottom quark loop is complex (cf. the 
behaviour of the function F(x) in Eq. (4.96)) and, as expected, is very small. However, 
it affects the cross-section at the 0.5% level due to interference with the rest of the 
amplitude. Beyond its importance to establishing a clear experimental signature of the 
Higgs boson, the H — yy decay is also particularly interesting from a theory point 
of view. Since the partial width is very small, it is rather sensitive to new particles 
that could couple to the Higgs boson and circulate in the virtual loop. Moreover, the 
threshold behaviour of the resulting contribution, cf. Eq. (4.96), could in principle be 
observed experimentally. Note that the closely related process, which proceeds through 
almost identical loop diagrams and thus is qualitatively rather similar, is the decay 
H — Zy. However, this mode results in an even smaller branching ratio once the 
decay of the Z boson is also folded in. 

Although the mass of the Higgs boson has now been determined at the LHC, it is 
useful to review the pattern of branching ratios that was expected prior to its discovery. 
As shown in Fig. 4.55, which depicts the branching ratios for a SM Higgs boson as 
a function of its mass, the most important decay channels are strongly dependent 
on the Higgs boson mass. This basic fact necessitated a very broad range of search 
strategies at the LHC. Note that these results include various higher-order QCD and 
electroweak corrections, although the basic pattern is very similar to the one that 
would be obtained using the lowest-order formulae above. As is clear from the figure, 
the branching ratios to the different decay products are very sensitive to kinematic 
thresholds, unlike the very smooth mass dependence observed in the Higgs boson 
production rates. Over the mass range shown, the decay of the Higgs boson to bottom 
quarks dominates for my < 135 GeV and, above that value, it is the WW branching 
ratio that is largest. Above 200 GeV the decays into WW and ZZ remain the most 
important channels, with a small contribution from H — tt that is as large as 0.2 for 
my ~ 450 GeV. It is interesting to note that the dominance of the WW branching 
ratio is true even below threshold, with at least one of the W bosons produced far from 
mass-shell. The presence of all of these features results in a very rich phenomenology 
and a difficult experimental task, with search analyses fine-tuned for different putative 
Higgs masses. The ZZ — 4 leptons and yy decay modes benefit greatly from the 
excellent identification and resolution of the detected particles. For the decay H — 
WW — 242v, the analysis is hampered by the missing transverse momentum which 
means that the candidate Higgs boson mass cannot be directly constructed. Although 
the transverse mass may be used as a substitute, the resulting resolution is not as 
good as in the fully reconstructed cases. Any of the decay modes involving jets of 
hadrons suffer from similar resolution issues but are complicated foremost by the fact 
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Fig. 4.55 The branching ratios of the SM Higgs boson, as a function of 
its mass, taken from Ref. [468]. 


that they must compete with large QCD backgrounds. This is especially true for the 
decay H — bb, with b jets produced prolifically in both pure QCD processes and in 
top quark decays. 

Following the discovery of a Higgs-like boson at the LHC in 2012, it is useful to 
focus specifically on the case my = 125 GeV. The branching ratios for this mass 
are shown in Table 4.2 which is taken from Ref. [460]. For each decay the table also 
includes the uncertainties due to variation of input parameters (a, and heavy quark 
masses) and due to uncalculated higher orders. The order in perturbation theory at 
which the branching ratio is known is also included, for completeness. One additional 
branching ratio is shown in this table that did not appear earlier: the one for the decay 
H > wt p-. The branching ratio for this mode is an order of magnitude smaller than 
that for H —> yy. Since both final states are well-reconstructed and also have smooth, 
understood backgrounds, one might expect that the luminosity required to observe 
H — w*p should be a factor of 100 larger than that needed for H —> yy. A more 
detailed analysis supports this rough estimate, with hopes that this rare decay mode 
could indeed be observed with several inverse attobarns of LHC data [401]. 


Higgs bosons 251 


Table 4.2 Branching ratios of a SM Higgs boson of mass 


mu = 125 GeV. The corresponding width is ly = 4.07 MeV. 
Decay mode Branching ratio Order of calculation 

bb 0.577432% N4LO QCD + NLO EW 
ww 0.215473 NLO QCD + NLO EW 
gg 0.0857+19:2% NLO QCD + NLO EW 
TT 0.063215:7% NLO EW 

ce 0.0291112:2% NtLO QCD + NLO EW 
ZZ 0.0264743% NLO QCD + NLO EW 
yy 0.00228+5:9% NLO QCD + NLO EW 
Zy 0.00154+3:9 LO 

uu 0.00022+8:0 NLO EW 


4.8.3 Width of the Higgs boson 


The predicted total width of the Higgs boson is given by summing over all contribut- 
ing partial widths so that, for instance, the simple expressions given previously can 
be used to obtain the leading approximation for the total width. For the boson dis- 
covered at the LHC, with a mass around 125 GeV, the total width is dominated by 
the decay into bottom quark pairs, cf. Table 4.2. The total Higgs boson width is thus 
well approximated, up to a O(1) factor corresponding to BR y5, by Eq. (4.100) with 
my — my. Compared to the widths of its partner electroweak bosons the W and the 
Z, the width of the Higgs boson is thus suppressed by a factor m?/mj,. This results 
in a SM prediction of Ty + 4 MeV for a Higgs boson of mass 125 GeV. 

This width is much smaller than the typical mass resolution of the LHC experiments, 
even in the best-measured channels such as H > yy and H — ZZ. Therefore a 
typical scan of the threshold region does not yield very much information about the 
intrinsic width. This type of direct scan yields only a rather weak bound at present, 
Ty <1600xI$™ [402], while an estimate of the eventual sensitivity using this method 
is at the level of Ty < 50 x rM [439]. A number of other methods to constrain the 
width directly have been proposed, relying on either interference effects that alter the 
shape of the mass distribution in diphoton decays [473] or on a comparison of Higgs- 
related cross-sections at the resonance and in the high-mass region beyond it [322]. 

The latter method rests on the observation that there is a significant fraction of 
Higgs boson events in the ZZ final state in the region m(ZZ) > my. This feature can 
clearly be seen in Fig. 4.56, which shows the cross-section as a function of the 4-lepton 
invariant mass for ZZ — 4l decays, for typical LHC cuts at 13 TeV. The large off- 
shell contribution is the result of two effects. First, the branching ratio BRy_,77 grows 
significantly as the virtuality of the Higgs boson approaches the threshold for producing 
two real Z bosons. Second, there is an additional enhancement of the spectrum in 
the region mag ~ 2m, when there is sufficient energy to resolve the internal top 
threshold in the loop. The result is that, for typical lepton cuts, the predicted number 
of H + ZZ — 4-lepton events in the off-shell region defined by mae > 130 GeV can 
be 15% or more [315, 322, 654]. However, this is not the full story, due to the fact that 
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Fig. 4.56 The 4-lepton invariant mass distribution expected from 
gg + ZZ — Ae for typical CMS cuts at 13 TeV. The contribution from 
the Higgs diagram alone is shown in blue, while the total contribution of 
all gg-initiated diagrams is shown in red. 


there is another class of diagrams that contributes to this final state at the same order 
in perturbation theory. These are indicated in Fig. 4.57 (right) and correspond to box 
diagrams in which a quark circulates in the loop. In contrast to the Higgs diagram on 
the left, the contribution of light quarks in the loop is non-negligible. The inclusion of 
the box diagrams has an important consequence in the off-shell region due to the fact 
that the two sets of diagrams interfere destructively at high energies. This behaviour 
is a consequence of the unitarizing effect of the Higgs boson on the production of 
longitudinally polarized Z bosons [723]. The effect of the destructive interference is 
clearly seen in Fig. 4.56, where the contribution of all diagrams is smaller than the 
contribution of Higgs diagrams alone in the region mae > 700 GeV. Information on 
the width of the Higgs boson may be obtained by noting that the peak cross-section 
is related to both the couplings and the width of the Higgs boson, while the off-shell 
cross-section does not depend strongly on the width. Such methods provide much more 
stringent limits than direct approaches, [yy < (5—9) x T$M [37, 661]. The experimental 
situation is discussed further in Section 9.6.4. 

In addition to these direct constraints, the total width of the Higgs boson can be 
determined indirectly by comparing measurements of its couplings in different chan- 
nels. Converting these measurements into constraints on the maximum width requires 
additional theoretical assumptions, for instance bounds on the couplings of the Higgs 
to W and Z bosons that are allowed in broad classes of SM extensions [475]. With 
this caveat, the indirect bounds can be rather strong: 0.3 < Cele < 3.56. 
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Fig. 4.57 Representative diagrams entering the calculation of 
gg — ZZ — A€ production, corresponding to Higgs contributions 
(left) and non-Higgs continuum box diagrams (right). 


4.8.4 Higgs boson production in gluon fusion 


As already discussed, the gluon fusion process representing the production of a Higgs 
boson, gg — H, does not occur at tree level but instead appears for the first time 
at one-loop. When one starts to consider higher orders in perturbation theory or the 
radiation of additional hard jets this makes the relevant calculations correspondingly 
more difficult and, in some cases, impossible to perform at present. 

For this reason it is convenient to formulate the diagram in Fig. 4.52 as an effective 
coupling of the Higgs boson to two gluons in the limit that the top quark is infinitely 
massive. This corresponds to the effective Lagrangian, 


s 11 as 
fe = (1+ k ) H tr Guu G”” + O(a2), (4.104) 


where the trace is over the colour degrees of freedom. Using the resulting ggH coupling, 
it is straightforward to compute the LO matrix element squared, 


2 i ræs? 
(LO) |* _ s 4 
M == (=) m4. (4.105) 
Alternatively, the same expression could have been derived by taking the limit m:/mx# —> 
oo in the corresponding expression for the same matrix elements, Eq. (4.94).4 By tak- 
ing the ratio of these two expressions one finds that the matrix element squared, and 
hence the cross-section, in the full theory is related to the one in the effective theory 


by the factor 
2 


R= —Vhull = |n (m? /m2,) (4.106) 


effective 4 


The value of this ratio is shown in Fig. 4.58, for Higgs boson masses between 100 
and 300 GeV. Although formally one would expect that this approximation is valid 
only when all other scales in the problem are much smaller than mų, in fact one finds 
that my < mų is sufficient to render the effective theory valid to within 10%. For a 
Higgs boson of mass 125 GeV, R corresponds to a correction factor of about 6.5%. In 
more complicated applications of the effective theory, for instance in the presence of 
jets, it has been found that the requirement pr(jet) < ms is necessary for an accurate 
approximation [454]. 


“In this limit it is easy to show that the function Ig(m?/m?,) > 4/3. 
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Fig. 4.58 The ratio, R of the LO cross-section for Higgs production 
through gluon fusion calculated in the full theory to the result in the 
effective theory. The ratio is given by the expression in Eq. (4.106). 


The clear benefit of the effective theory is the ease with which higher-order cor- 
rections can be computed. The NLO corrections were first computed long before the 
discovery of the Higgs boson [438], with the calculation performed in a very similar 
style to the Drell-Yan case discussed earlier in Sections 3.3.2 and 3.3.3. The contribu- 
tion of the loop diagrams is, as in the Drell-Yan case, proportional to the leading-order 
matrix element. Explicitly, using the dimensional reduction scheme, the result is 


R (1—loop) (LO)] Ne ( PN 2 2 (LO) |? 
e|M x M = > cr Soa te Mosh (4.107) 


gg>H gg>H or m2, 


Since the LO amplitude contains the strong coupling, it must be renormalized at this 
order. This is achieved by adding a term, 


bo As 2 
—2— cr — 
E 


(LO) 
2T ua 


gg-H| >? 


(4.108) 
where the overall factor of two reflects the O(a?) nature of the LO matrix elements. 
In addition there is a finite renormalization of the strong coupling associated with the 


use of dimensional reduction, to bring its definition into the standard MS scheme, 


LO 
= |My al . (4.109) 
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Finally, the full calculation should also include the O(a,) correction to the effective 
Lagrangian shown in Eq. (4.104), which it is natural to include here. Accounting for 
all of these contributions, the renormalized virtual result is 


as Ne 2 (uN? 2b 3 (Lo) |? 
= «| (£) ey Pate nee (4.110) 


The real corrections from the process gg —> Hg are easily computed using either an 
explicit calculation or making use of the Catani-Seymour dipoles that were introduced 
in Section 3.3.2. The derivation of these contributions is very similar to the calculation 
of the closely related Drell-Yan process, that was explicitly worked out there. The full 
result is obtained by adding the real radiation corrections and the appropriate PDF 
collinear subtractions to the result in Eq. (4.110). In this way one arrives at the 
expression for the NLO corrections (cf. Eq. (3.219) for the Drell-Yan case), 


1 
asNe LO 0(a1x28 —m? ) 
doses (Po Pos) = MEO), J asida Oltita — MH) 


| gg—> 2 
27 J Mir 


1 1 
x 11 4r? 
fas fa fajba (2. n) falha (Ẹ, n) E 3 | 6(1 — £1) (1 — £2) 


2 ie ee (= z E 2) ( (l= ee 
+ (= log TA . +2 z 2+& — & | | log A 
_¢,)3 
+5 | ou - a) 
2 ds =i] l ( eee z) ( (l= aa 
vi (Hs me oup + Ee £2 BASN a oup 
gy3 
+ q a 6(1 a}. (4.111) 


To compute a complete set of corrections at this order one must also include contri- 
butions from the real radiation diagrams, gq —> Hq, and all crossings. These contain 
no virtual contributions and their calculation is more straightforward. The net effect 
of the NLO corrections is large, which can already be seen from the coefficient of the 
LO-like term, proportional to 6(1 — €) 6(1 — 9), in Eq. (4.111). 

The size of the corrections motivated the calculation of NNLO corrections to the 
Higgs boson cross-section using the same effective theory approach [158, 615], as al- 
ready discussed in Section 3.4.1. The inclusion of the NNLO terms provided only a 
relatively small further correction, thus stabilizing the perturbative expansion of the 
cross-section. However, the residual scale uncertainty remained at the 10% level until 
the recent completion of the full NLO calculation [157]. The results of that calculation 
are illustrated in Fig. 4.59, which shows the cross-sections and uncertainties at each 
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Fig. 4.59 The scale uncertainty of the Higgs gluon fusion cross-section, 
computed in the range [ma /4, Mmg], versus the collider energy, ys. The 
cross-section is computed at LO, NLO, NNLO, and N?LO in the strong 
coupling. Reprinted with permission from Ref. [157]. 


order of perturbation theory. At NLO both the size of the correction, and the total 
scale uncertainty, are at the level of a few per cent. This has an immediate impact 
on the precision Higgs programme of the LHC, for example by significantly reducing 
the theoretical uncertainty associated with the extraction of the Higgs boson coupling 
strengths. Other improvements in the accuracy of the theoretical prediction can also 
be taken into account. For instance, the predictions shown in Fig. 4.53 also include 
contributions from electroweak corrections [123]. These represent a few per cent ef- 
fect but introduce a small additional uncertainty since one must choose a scheme for 
combining the separate QCD and electroweak corrections. 

An important issue in detecting events in which a Higgs boson is produced in this 
channel is the presence of additional jet activity. There are many significant back- 
grounds to Higgs production that naturally contain jets so that it is natural to try 
to use the jet information to better separate the signal and background processes. 
One example of this is top pair production, which has a considerable cross-section and 
leads to final states containing two W bosons and two jets and is thus a background 
to searches looking for the decay H — WW. Calculations of the signal rate in associ- 
ation with additional jets have been performed at NLO for up to three jets [420] and 
at NNLO for H +1 jet [270, 271, 392]. As a further example of the stabilizing effect of 
higher orders in perturbation theory, the cross-section for Higgs+jet at LO, NLO, and 
NNLO is shown in Fig. 4.60, as a function of the transverse momentum required to 
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Fig. 4.60 The NNLO cross-section for Higgs production in association 
with a jet, as a function of the minimum jet p] . Reprinted with permission 
from Ref. [271]. 


define the jet. However, the real issue is that all these higher-order calculations yield 
predictions for cross-sections that include at least the number of jets present in the 
leading-order process. That is, the NLO calculations explicitly include up to one addi- 
tional jet and the NNLO ones up to two. However, the signal discrimination requires 
predictions for an exact number of jets — and no more than that. In the simplest 
case, one is thus interested not in the inclusive Higgs cross-section but the cross-section 
for Higgs production with zero jets, that is, in the presence of a jet-veto. However, 
this type of veto on additional radiation can render the fixed-order perturbative cal- 
culations more unreliable. Particularly if the jet-veto scale is much smaller than the 
Higgs boson mass, as is usually the case, the perturbative expansion develops large 
logarithms of the form log(p\°*°/my,). In order to recover accurate predictions in this 
case it is necessary to go beyond the fixed-order approach and perform a resummation 
of these types of logarithm. Such calculations will be discussed further in Chapter 5. 


4.8.5 Amplitudes for H+jet production 


Although first results for the H+jet process are now available at NNLO, these have 
all been computed in the effective theory. As already noted, these calculations are 
expected to break down for values of the jet transverse momentum larger than about 
my. For this reason, it is still useful to consider lower-order perturbative predictions 


5This is not the case for other decay modes such as H > yy and H —> ZZ* where inclusive 
measurements can be carried out. See Section 9.6. 
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in the full theory. Of course, in the full theory even the lowest-order prediction already 
contains loop diagrams and so the resulting matrix elements contain the usual one-loop 
functions. These matrix elements were first computed in Ref. [500] and the results of 
that calculation are summarized here. 

There are two basic parton-level processes that must be considered: 


(a): O- g(p1) + d(p2) + q(p3) + A, (4.112) 
(b): O-— g(p1) + g(p2) + g(p3) + H. (4.113) 


The amplitude for process (a) is relatively simple. It can be written as 


1672 4mw 2° 78 


EEA (e = cuit) €a (pı)F (s23, 5H) 


2 
g 1 
Mosg = a A) (4.114) 


$23 pi: (p2 + ps) 


where €,(p1) is the polarization vector of the gluon, sy = (pı +p2+>p3)? and the loop 
function F(s23, 87) for a single quark of mass mg is given by 


F (893, 8H) = —8m2]2 — (su — s23 — 4m2)Co(p1, P23; Mq, Mg, Mq) 
q q 
2s 
= (Bo(pı2s; Mq, Mq) z Bo(p23; Mq, ma) ` (4.115) 
SH — $23 


This function depends on the bubble and triangle scalar integrals defined by 


4-D 
H D 1 
B ; = — p [(1— dl 
A a | 8 aaa mE 
1 
Co(p1, p23 M1, M2, M3) = a} (4.116) 


1 
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The corresponding matrix element squared is 


3 2 2 2 
Is IW S12 + $73 2 
M BS a SE ON Gp y ; , 4.117 
|Mgq—qH| ae = ) PF 5a (oH = soa)? (s23, sH )| ( ) 


For process (b) the amplitude is 


gw gt 


2 
mw 3272 SH [ABC €a(p1)€a(p2)ey(p3) 


Mgg>gH = 


F3°" (p1, p2, ps) A3(p1,P2, pa) + Fe?" (pi, p2, ps) Ao(P1, P2, D3) 
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+F 2" (p2, p3, p1) A2(p2, p3, p1) + T] , (4.118) 


where the colour labels of the gluons are denoted by A, B, and C. This equation 
introduces the projectors F; and F> that are defined by 


B Boa y y 
g“ pip p p 
Fe" (p1, p2, ps) = ( 2 )( 2 i ) 


Pı ` P2 pı: p? P2: P3 Pı ` P3 


a,b OY — pabr aß i Bi 
FSP Pe DES P3 Pı P2 — P2 P3 P1 g Pi P2 (4.119) 
2 Pp P DP. 
Pı : P2 P1 : P3 P2: P3 Pi: P2 \ P3°P1 P3 : P2 


g f o Beg. Ps Pi 
P2°P3\P1°P2 Pı: Ps Pı: P3 \ P2:P3 P2'Pı 
The functions Az and A3 contain the loop integral functions and it is convenient to 
rewrite A3 in terms of a further function, A4, as follows 


1 
A3 (pı, p2, p3) = z [A2(p1, P2, P3) + A2(p2, P3, p1) + A2(P3, p1, p2) — A4 (p1, P2, P3) - 
(4.120) 
The functions A and Ay are then defined by 


A2(p1, P2, P3) = b2 (512, $13, $23) + b2 (812, $23, $13), 
A4(p1, p2, p3) = ba(s12, $13, 823) + b4 (813, $23, $12) + b2(S23, $12,813) (4.121) 


where for a single quark loop the functions bz and b4 read 


netaa 4 | eN & r) (Wo(s) — W2(s17) + Walsstrussn)) 


SH 3 SH 4 
mrs(u—s) 2ut(u+ 2s) 
HRA st, [ s+u T (s +u)? (a= Walaa) 
+(m2— =) (3020) + T Walser) — Wa(t) + W3(s,t, u, sn) 
+s° (te Xs - 3) (W2(t) — W2(su)) + < (W2(sH) — 2W2(t)) 
+ (s 12m? w Wolt, s,u, s1) (4.122) 


The remaining functions, W1, W2 and W3 are remnants of the scalar integrals that 
enter the calculation. Their definitions are as follows: 


1 r l 
Wı(s) =2+ f dz log (1 Te x(1— x) ic 
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Wo(s) = 2 | * log (1 =. «(1 —2) ie (4.123) 
0 T mg 
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and explicit results in all kinematic regions of interest are given in the appendix of 
Ref. [500]. The corresponding matrix element squared is 


2 6 N2 

g Is 8 CCF | 2 2 
M 2 Ws A +|A 
|Mgg—91| DBC eases |A2(p1,p2,p3)|" + |A2(p2, p3, p1)| 


+ |A2(p3,; p1, p2)" + |44 (p1, p2, p3)|7]. (4.124) 


These matrix elements can be used to make a comparison of the Higgs+jet cross- 
section in the full theory with the effective theory approximation. Such a comparison is 
shown in Fig. 4.61, where the cross-sections have been computed at y/s = 13 TeV and 
for my = 125 GeV. The ratio R, defined analogously to Eq. (4.106), demonstrates the 
anticipated behaviour. The cross-section at low jet p, is larger by the same factor as for 
the inclusive Higgs cross-section, cf. Fig. 4.61. The difference between the calculations 
actually decreases as p; increases, until the result of the full calculation is smaller 
than in the effective theory. Nevertheless, the approximation remains reasonable for 
pı(jet) S 200 GeV. At higher jet momenta the approximation quickly breaks down, 
resulting in a very poor description in the effective theory. 

The rate for producing a Higgs boson in association with an additional jet is signif- 
icant at the LHC, due to the nature of the gluon-fusion process. It is more important 
than, for instance, the similar Drell-Yan process due to the fact that the gluons natu- 
rally radiate more copiously than quarks. Indeed, the cross-section for producing more 
than one jet is still rather high. This is indicated in Fig. 4.62, which shows cross- 
sections for the production of a Higgs boson in association with up to three jets as a 
function of the machine operating energy. At LHC energies, with typical jet cuts, the 
cross-section for producing H+jet is about half the inclusive Higgs production rate, 
and the rate for two or more jets is only smaller by about a further factor of three. 
As ys increases, the jet cross-sections grow relatively more rapidly if the jet definition 
remains the same. Thus the study of Higgs boson processes that contain additional 
jets becomes even more of a concern for any future hadron collider. 

The first measurements of differential jet production in association with a Higgs 
boson at the LHC will be discussed in Section 9.6.5. 


4.8.6 Higgs boson production in weak boson fusion 


As already mentioned, weak boson fusion is an especially valuable production mode 
for hadron colliders. Although the cross-section is an order of magnitude smaller than 
for gluon fusion, it provides a valuable probe of the coupling of the Higgs boson to 
W and Z bosons, independent of the manner in which the Higgs boson decays [902]. 
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Fig. 4.61 The ratio, R of the LO cross-section for Higgs+jet production 
in the full theory to the result in the effective theory, as a function of the 
jet transverse momentum, for my = 125 GeV. 


On the theoretical side, the weak nature of the process means that the cross-section 
is under good control. The NLO corrections are small and the scale uncertainty at 
this order is around 10% [171, 228]. To perform a computation beyond NLO it is 
useful to work in a structure function approach, where the process is described by 
the independent production of two W or Z bosons by each initial parton. The W 
or Z bosons then subsequently interact to produce the Higgs boson, as indicated 
schematically in Fig. 4.63. 

This double deep-inelastic scattering approach is an excellent approximation be- 
cause of the fact that interference effects between the quark lines are very small. Using 
this framework it has been possible to compute the NNLO corrections to the WBF pro- 
cess [263, 264], including also the calculation of fully differential observables [301]. Most 
recently the calculation has been extended to N3LO for the total cross-section [488]. 
Fig. 4.64 shows the scale dependence of the cross-section through NLO, computed us- 
ing the calculation presented in Ref. [488], as a function of the pp collider energy (vs). 
This indicates that the scale uncertainty in the NLO prediction for the weak boson 
fusion cross-section at the LHC is at the level of a few per mille. The electroweak cor- 
rections to this process have also been computed at NLO and included in the HAWK 
code [398]. The corrections are of the same size as the NLO QCD contributions and 
must therefore be taken into account. 

Isolating a clean sample of events in which the Higgs boson is produced through 
weak boson fusion is complicated not only by the presence of the usual SM back- 
grounds but also by production of the same final state through other Higgs processes. 
For instance, the Higgs boson may be accompanied by two jets through associated 
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Fig. 4.62 Cross-sections for Higgs production (my = 125 GeV) through 
gluon fusion at proton—proton colliders as a function of centre-of-mass 
operating energy, ys. Cross-sections for production of a Higgs boson in 
association with jets are computed for jets satisfying pr > 40 GeV, |y| < 5 
and kr-clustering with D = 0.4. 


production in which the W or Z boson decays hadronically. However, the biggest such 
contamination occurs from the gluon fusion process, with the radiation of additional 
jets from the initial state. Enhancing the efficiency of selecting weak boson fusion 
events by applying a large rapidity separation between two of the jets in the event still 
leaves a substantial contribution from gluon fusion events, as shown in Fig. 4.65. This 
presents an additional complication when trying to make a precision measurement of 
the couplings of the Higgs boson in this channel. 

An interesting aspect of the weak boson fusion process is its ability to probe the 
tensor structure of HW*W~ and HZZ couplings [811]. The most general possible 
structure of the HVV vertex (where V = W= or V = Z) consistent with gauge 
invariance can be written as, 


Chyv (P1, p2) = a1(p1, p2)g"” + a2(p1, p2) (pi + p2 g” — pips) 
+a3(p1, pa)?" p1, pP2,0- (4.125) 
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Fig. 4.63 Schematic picture of the double deep-inelastic scattering ap- 
proach to WBF. 


The SM is realized by a constant value for a1, independent of the vector boson mo- 
menta pı and pə (cf. the Feynman rules for the SM Higgs interactions in Fig. 4.49). 
A non-constant value for a1, or the new tensor couplings represented by az and ag in 
Eq. (4.125), can be realized by loop-induced couplings within a particular model of 
new physics. az represents an additional CP-even coupling, while ag is CP-odd due to 
the presence of the epsilon tensor in the interaction. The gold-plated observable for 
differentiating between these three types of coupling is the azimuthal angle between 
the two tagging jets, Ad,,;. Fig. 4.66 illustrates the expected distribution of this angle, 
for the SM case and for pure CP-even or CP-odd contributions represented by anoma- 
lous couplings ag or a3. The SM expectation is for a relatively flat distribution, while 
the anomalous CP-even and CP-odd couplings lead to pronounced dips at A¢d;; = 90° 
and Ad¢;; = 0, 180° respectively. This behaviour can be understood from the structure 
of the interactions and resulting matrix elements [811]. For example, the presence of 
the epsilon tensor in the CP-odd interaction means that it vanishes when there are 
fewer than four independent momenta in the process, that is, when the tagging jets are 
collinear. The picture is more complicated in the presence of an admixture of SM and 
anomalous CP-odd and CP-even effects, but the observable Ag;; remains an excellent 
probe. 

Note that a similar analysis can be made for Higgs boson events produced through 
gluon fusion, where the SM Hgg effective coupling takes exactly the form of the ag 
term in Eq. (4.125). As a result, the SM expectation for the Ad¢,,; distribution in gluon 
fusion events has a shape similar to the CP-even (a2) curve in Fig. 4.66. 


4.8.7 Higgs boson production in association with heavy particles 


The associated production channels comprise two distinct cases. In the first, the Higgs 
boson is produced through its coupling to W and Z bosons, through the first two 
diagrams shown in Fig. 4.50. Although the cross-sections for these production modes 
are smaller than for weak boson fusion, they do offer the possibility of providing 
additional information on the couplings to W and Z bosons separately. From the 
form of these diagrams it is clear that, theoretically, these processes are closely related 
to the Drell-Yan process: for the most part, they can be described by the off-shell 
reaction, pp + V*, followed by a subsequent decay, V* — VH. This similarity has 
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Fig. 4.64 The scale uncertainty in the weak boson fusion cross-section, 
at each order through N*LO. The cross-sections are shown at each order 
as a function of the collider energy and are normalized to the NLO result. 
Reprinted with permission from Ref. [488]. 


enabled theoretical predictions for the cross-sections to be made at NNLO in the strong 
coupling [524]. Moreover, the NLO electroweak corrections are also known [458]. 
This channel is particularly interesting as a way to get a handle on the bottom 
quark decay mode, H — bb. A standard analysis of this mode would be plagued 
by large backgrounds, in particular from top pair production that yields events with 
similar kinematic properties. However, in Ref. [297] it was shown that sensitivity could 
be recovered in these channels by looking in the boosted regime where the vector bosons 
are produced back to back and at large transverse momenta. Although this significantly 
reduces the signal cross-section, the dominant backgrounds are impacted even more 
strongly. However, the key to utilizing the boosted kinematics fully comes from the 
properties of the jets that should be produced in the signal process: the bb pair should 
be reconstructed in a fat jet, as discussed in Section 2.1.6. In order to provide the 
best discrimination against background processes it is also necessary to enforce a veto 
against any additional jet activity. The application of such a requirement greatly affects 
the size of the QCD corrections in this case. For the inclusive VH cross-section the 
effect of higher-order corrections is rather mild, but after applying a jet veto this is no 
longer true. The situation in the presence of the jet veto is indicated in Fig. 4.67, which 
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Fig. 4.65 The cross-section for Higgs production through gluon fusion 
and weak boson fusion, as a function of the rapidity separation between 
the two jets. Jets are defined using the anti-kr algorithm with D = 0.4 
and satisfy pr > 40 GeV. 


shows the theoretical prediction for the pı distribution of the fat jet in WH events at 
LO, NLO, and NNLO. There is a significant negative correction at NLO and a smaller 
further reduction at NNLO. The fact that the first-order correction is so large indicates 
that further work is required to obtain a reliable prediction for the cross-section in 
this region. Since, at this point, an even higher-order calculation is infeasible, a better 
avenue for improving the prediction is through the use of resummation techniques of 
the type that will be discussed in Chapter 5. 

The second associated production channel results in a final state containing a 
Higgs boson together with top quarks. A top quark can be produced through one of 
the normal strong processes, with the Higgs boson coupling to it through the Yukawa 
interaction. The largest such cross-section is ttH production (see Fig. 4.51), taking 
advantage of the large top quark mass to enhance the Higgs coupling. Even so, at 
LHC operating energies, this cross-section is the smallest of the SM production modes 
shown in Fig. 4.53. As a result the LHC has only limited sensitivity in this channel 
until the accumulation of bigger datasets in Run II and beyond. Nevertheless, such 
data offer the chance of providing direct evidence of the coupling of the Higgs boson 
to the top quark. It could also yield information on the coupling to bottom quarks 
through a striking signature containing four identified b jets, two originating from the 
top quark decays and the remainder from a H — bb decay. Since the lowest-order 
process is much more complicated than in the other channels, predictions for the ttH 
cross-section and related observables are only available at the NLO in QCD [209, 440]. 
The closely related process, where the Higgs boson is produced in association with 
bottom quarks, is too small to be probed at the LHC in the SM. 
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Fig. 4.66 The azimuthal angle between WBF jets for SM Higgs produc- 
tion (magenta), and anomalous production through pure CP-even (green) 
and CP-odd (blue) couplings. Reprinted with permission from Ref. [613]. 


It may also be possible to achieve additional sensitivity through the t-channel single 
top process shown in Fig. 4.68. However, this channel suffers from larger experimental 
backgrounds and the sensitivity to the Htt coupling is smaller because its effect is 
washed out by the coupling of the Higgs boson to the t-channel W boson in this 
process. 


4.8.8 Vector boson scattering 


A class of processes that will constitute a key part of the future LHC programme is 
termed vector boson scattering. A vector boson scattering (VBS) process is so- 
called because, at its core, it should probe an amplitude, 


Vi + V2 Vg + Va, (4.126) 


where Vi,..., V4 are a suitable combination of Z and W~ bosons. Soon after the devel- 
opment of the Higgs theory, such processes were identified as keys to demonstrating 
the validity of the known SM. The reason is that, at high energies, the behaviour 
of vector boson scattering amplitudes is very sensitive to the gauge structure of the 
electroweak sector [723]. 

Consider the case of W boson scattering, Vi = V3 = Wt, Vo = Va = W7 in 
Eq. (4.126). There are three prototype diagrams for this process, as shown in Fig. 4.69, 
corresponding to s- and t-channel exchange of an intermediate boson and a single 
diagram involving the quartic coupling. In the high-energy limit of this scattering 
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Fig. 4.67 The transverse momentum distribution of the fat bb jet in 
WH(— bb) events at LO, NLO, and NNLO, from the calculation of 
Ref. [524]. Reproduced with permission from Ref. [310]. 


b 


Fig. 4.68 Lowest-order Feynman diagrams for the production of a Higgs 
boson in association with a single top quark. 


process, the amplitude represented by these diagrams is dominated by the longitudinal 
polarizations of the vector bosons that grow with the energy, E. Naively one thus 
expects the amplitude to grow as E*. However, the gauge structure of the SM ensures 
that the leading behaviour of the quartic-coupling diagram is exactly cancelled by 
contributions from the exchange of intermediate Z bosons and photons. As a result any 
anomalous quartic coupling, resulting from simply rescaling the SM quartic coupling 
strength, would greatly affect the size of this scattering cross-section. 

The final component of the puzzle, that prevents the amplitude scaling at the 
subleading Æ? level, is provided by the Higgs boson. Inclusion of the s- and t-channel 
Higgs exchange diagrams cancels the remaining Æ? dependence, rendering the SM 
amplitude well behaved at high energies. Even though the amplitude does not diverge, 
the perturbative expansion may still not respect unitarity. A partial wave analysis of 
all vector boson scattering channels [723] leads to a powerful constraint on the Higgs 
boson mass: 
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(a) (b) (c) 
Fig. 4.69 Prototype Feynman diagrams representing the scattering pro- 
cess WWT — WtW . The exchanged particle in diagrams (a) and (b) 
may be a photon, Z or H boson. 


1 
2 
mH < (3) ~ 1 TeV. (4.127) 


Indeed, this bound was a powerful argument that a Higgs boson below this mass, or 
something playing that role, should be observed at the LHC. Even with the discovery of 
a light Higgs boson, it is still possible that this particle alone is not wholly responsible 
for unitarizing the high-energy behaviour of the amplitudes. Probing vector boson 
scattering is therefore an essential closure test of the SM. 

At the LHC it is possible to probe vector boson scattering through the O(a*) elec- 
troweak processes pp > V,V2j77. Representative diagrams are obtained by attaching 
quark lines to two of the vector bosons in Fig. 4.69. However, such diagrams only 
represent a small fraction of all the possible diagrams one may draw at this order in 
perturbation theory. The remaining diagrams represent, for instance, simple emission 
of vector bosons from quark lines. All of the vector boson scattering processes have 
been computed at NLO in QCD and implemented in the VBFNLO program [187]. 

A search for VBS is challenging for a number of reasons. First, the weak nature of 
these processes means that cross-sections are naturally small. Second, the VBS compo- 
nent must be isolated from the relatively uninteresting other production mechanisms. 
Finally, due to the inherent cancellation mechanism at high energies, the sought-after 
signal is even smaller than might otherwise be expected. Cross-sections for various 
VBS processes at 13 and 100 TeV are shown in Table 4.3. Since the cross-sections are 
so small, and the important aspect to probe is the contribution at high-energies, the 
study of these processes is much more fruitful at 100 TeV. Note that the t-channel 
exchange diagrams mean that there is a significant rate for same-sign W-boson produc- 
tion. This is particularly interesting since the same-sign final states have a considerable 
benefit in that they suffer from far fewer backgrounds than the other channels. 


4.9 Summary 


This chapter has discussed the application of fixed-order QCD perturbation theory to 
a variety of hadron collider processes. While the technical achievements in this arena 
are impressive, from NLO computations of 2 — 6 processes to calculations at NNLO 
and beyond, this survey has also revealed a number of shortcomings of this approach. 
A central theme of the breakdown of the fixed-order description is its application to 
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Table 4.3 Cross-sections for vector boson scattering processes at the LHC and 
a 100 TeV proton-proton collider. Jets are defined by the anti-kr (D = 0.4) 
algorithm and satisfy pr > 20 GeV, |7| < 5 and m,;; > 100 GeV. 


Final state Nominal process (13 TeV) [fb] (100 TeV) [fb] 
CVV yy II W-Wr9j 11.2 316 
Vee Vy pe" IJ WrWwrgj 3.35 114 
eC Del Dy jj W-W- jj 1.21 69.6 
Veet uT ujj W253 1.44 39.3 
e Det ptjj WoZjj 0.89 31.8 
eet ujj ZZjj 0.29 10.5 


final states where the kinematics are particularly restricted. Examples highlighted were 
threshold production of photon pairs (Section 4.2.3) and top quarks (Section 4.5), as 
well as the effects of a jet veto in Higgs production by gluon fusion (Section 4.8.4) and 
in the associated VH mode (Section 4.8.7). 

In these cases the fixed-order approach results in perturbative predictions that do 
not sufficiently converge at sucessive orders of calculation, or that exhibit unphysical 
behaviour. In each case the origin of these symptoms is a large logarithm, present at 
each order of the calculation, which spoils the expected perturbative behaviour. As 
explained in detail in the following chapter, these logarithms can be systematically 
identified and analytically resummed to restore the predictive power of QCD in these 
situations. Moreover, Chapter 5 will also show how similar ideas may be used to 
perform resummations numerically in order to provide parton shower predictions that 
offer an extremely wide range of applicability. Ameliorating the fixed-order description 
discussed in this chapter with such methods will be crucial to achieving the level of 
understanding necessary to confront the theory of QCD with experimental data. Such 
comparisons will be presented in some detail in Chapters 8 and 9. 


5 
QCD to All Orders 


As already discussed in Section 2.3, often kinematic situations occur which are 
characterized by very different scales u;i. Then logarithms of the type log(~o/f41) may 
become so large that they overcome the smallness of the couplings, alog(pi9/~1) ~ 1. 
In such cases, truncating the perturbative expansion at any fixed order will not yield 
correct results, and rather than attempting to include, order by order, perturbative cor- 
rections, it is often more important to resum such dangerous logarithmically enhanced 
terms to all orders. In this way, the programme of resummation of large logarithms 
complements the fixed-order efforts described in previous chapters. In fact, there are, 
broadly speaking, two ways to try to resum large logarithms, namely either through 
analytic methods, which were introduced in Section 2.3, or numerically, by means of a 
parton shower. In both cases the name of the game is to further push the accuracy by 
combining the knowledge of both fixed-order results and of the logarithmic structures 
in one best theoretical prediction. 

In order to develop an intuitive understanding of a number of QCD phenomena 
further, the chapter starts with revisiting the QCD radiation pattern in Section 5.1. 
There, some care is taken to better quantify some ideas concerning the interface of 
perturbative QCD and emissions described by it and of the non-perturbative phase 
governed by hadrons and hadronization. Some first phenomena, such as angular or- 
dering or the QCD hump-backed plateau, will be elucidated based on these rather 
quantitative considerations. 

In Section 5.2 the discussion of analytic resummation with the example of the 
pı spectrum of the W boson already sketched in Section 2.3 will be extended to 
higher accuracy and, in addition, also other processes will be considered. To round 
this section off, other analytic resummation methods will be introduced, which either 
aim at different kinematical situations or are based on a different formalism. 

Section 5.3 introduces the numerical implementation of resummation in the proba- 
bilistic parton shower picture. It will connect the parton shower to the analytic resum- 
mation discussed before. In addition, various methods to improve the formal accuracy 
of event simulation through the parton shower by including higher-order exact matrix 
elements will be summarized in Sections 5.4 and 5.5. They have been at the centre 
of formal developments in the framework of simulation tools and continue to play a 
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central role in the quest for an ever-improved precision in the analysis of LHC data. 


5.1 The QCD radiation pattern and some implications 


This section examines some general features of the QCD radiation pattern. It follows 
the insightful discussion in various parts of the excellent book by Dokshitzer et al. [477]. 


5.1.1 The QCD radiation pattern revisited 


5.1.1.1 Reminder: Characteristic scales 


Coming back to Eq. (2.34), the differential probability for the emission of a gluon with 
energy w and transverse momentum k, off a quark with energy E is given by 


dw = alki) Cr +. “ [1 + (1 < =) (5.1) 


This yields a relatively broad spectrum both in transverse momentum and energy of 
emitted gluons, peaking at small transverse momenta and small energies. As already 
discussed in Section 2.1, the spectrum is cut off at small transverse momenta and 
energies by the onset of hadronization. This process takes place at distances of typical 
hadron radii of R ~ 1fm or at masses of the order of at most a few Agcp. Broadly 
speaking, in a hard scattering process characterized by the hard scale Q, two classes 
of secondary emissions present themselves. First, there are emissions, where 


1 
R< ki www R — wt? y a, (ki) < 1, (5.2) 


signalling the production of jets. Second, there are emissions, where 


1 
e e wi ~ a(k )logki ~ 1, (5.3) 
associated with inner- and intra-jet radiation. 

Taking a closer look, different orderings in these emission patterns can be further 
distinguished, which lead to different physical phenomena, namely 


e double-logarithmic enhanced emissions, 


1 

pIo, (5.4) 
constituting the bulk of the emissions and producing the Bremsstrahlung pattern 
in inner-jet emissions; 


e hard collinear emissions, 
1 
5 Ski Kw~Q, (5.5) 
R 
which are responsible for scaling violations in DIS and similar; 


e soft, typically wide-angle, emissions, 
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1 

Ss ISk ww KQ, (5.6) 
R 

which are responsible for emissions in the phase space between jets, and which 

lead to the observable drag effect (see Section 5.1.2.2). 


In the following, these different kinematic regions will be investigated in more detail, 
pointing out their visible consequences. 


5.1.1.2 Time scales and the link to hadronization 


In Section 2.1.4 typical time scales defining the hadronization process have been dis- 
cussed, in particular the formation time ¢f'™ of a parton and its hadronization time, 
ttad, Following Eq. (2.39) and Eq. (2.38) they are given by 


| k 
form — Sl and P = kR. (5.7) 


Demanding that partons are formed before they hadronize (or, in more martial terms, 
that they are born before they die) automatically implies kų > 1/R. Extending this 
reasoning to study the dynamics of the phase transition from partons to hadrons in 
more detail, consider partons living at the edge, i.e., those partons for which ky ~ 
1/R, aptly dubbed “gluers” by the authors of [477]. Such gluers are first formed at 
times R, with momenta given by kj ~ kų ~ w. As time increases, more and more of 
such gluer-partons are being formed. 

Assuming that the spectrum of hadrons closely follows that of the partons, a con- 
cept known as local parton-hadron duality (LPHD) [177], and assuming the 
absence of parton emissions below the cut-off R allows the identification of final-state 
hadron energies € with the energies of gluers w. Keeping only the dominant logarithmic 
enhanced terms when integrating over the soft emissions encoded in Eq. (5.1) yields 
the approximate hadron energy spectrum, namely 


Q 
dk? Cras(k? wj] dw 
AN (hadrons) ~ J a = sl a [1 (1 )I Ww oo 
k1 >1/R = 
s(1/R? 
„~ Cras(l/R*) log(Q?R?) dlog w(5.9) 
T 


log(Q? r?) Ž = CaN R’ 


Replacing w — e means that the distribution of hadron energies approximately fol- 
lows the form 
dNinadrons)/dloge = const. , (5.10) 


a plateau in the logarithm of their energy. Therefore, the energy distribution of the 
hadrons peaks at an energy €min that can be related to gene ~ 1/R ~ mnaa, the 
typical hadron length and mass scales. 

It is interesting to study how additional hard radiation changes this naive picture. 
To gain some insight, consider the case of a single secondary parton emitted under 
an angle @ and introduce the separation time t*P), after which it reaches a distance 


The QCD radiation pattern and some implications 273 


R from the original parton. For such secondary partons, a hierarchy of time scales 
emerges, namely 


k 
fiorm 5 MI 
ki 
tP v RO Ww form (Rk) 
d o kR? ~ t™ (Rki)? (5.11) 


For gluers, as Rk, ~ 1, these scales are all identical, but they differ for “proper” gluon 
emissions. 

The natural question now is, how such hard secondary partons enter the hadroniza- 
tion business? The quick answer is fairly straightforward: further gluers are formed, 
following the secondary parton. This can however be further quantified. 

From 6ltet) ~ @ and pisiuer) ~ 1/R one finds we") ~ 1/(RO) and therefore 
characteristic times R/@. In other words, at a time t©°?) the secondary parton starts 
decoupling from the other colour sources through the emission of gluers. They in turn 
also become hadrons with characteristic energies of about w'@'"*"), which are a factor 
1/0 larger than those stemming from the original, primary parton. The secondary 
parton therefore starts looking like a jet, produced at its own hardness scale Q ~ kı, 
but with energies boosted by 1/6. It therefore does not contribute to the yield of the 
softest hadrons. The explanation for this is that for hadron energies in the interval 


1/R Š wonadron) © 1/(R8) (5.12) 


the primary and secondary parton are not yet separated enough to start emitting 
gluers independently — the soft colour field manifesting itself in the gluers just “sees” 
the combined colour charge of both. This is the most striking manifestations of colour 
coherence in the QCD emission pattern. 


5.1.1.3 Quantum coherence in the emission pattern 


For further insight into this quantum mechanical effect consider its analogue in QED, 
where it is known as Chudakov effect [397]. Recently [882] this effect has been 
experimentally confirmed. It occurs in the emission of a secondary photon by an 
electron—positron pair produced in the induced splitting of a primary photon, sketched 
in Fig. 5.1. 

Assume a Lorentz-frame in which the electron and positron carry about the same 
amount, about half, of the primary photons energy pj and where the opening angle 
Oee of the pair is small. In such a frame, the relative transverse momentum of the 
e et-pair, p], satisfies 

sin bee X bee X “= (5.13) 


and the transverse component of the pair’s wave vector is given by A}, = 1/p1. The 
secondary photon must be formed in a time given by the off-shellness of its emitter, 
here the positron. With a splitting variable z such that the longitudinal component of 
the photon momentum is ky = zp, the positrons virtual mass is about 
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(1— z)p + P1/2 


2D +p, /2 


Fig. 5.1 The Chudakov effect: a primary photon with momentum pi splits 
into an electron—positron pair with momenta po + pt), which in turn 
emits a secondary photon with momentum k. The fermion momenta before 
photon emission are given by p*t? = zp tp. /2 and p= (1—z)p +P. /2, 
respectively, and pı denotes their relative transverse momentum. 


ki i (pH)? sin? bey ~ (zp) 0y (5.14) 


leading to an energy imbalance of 


k2 
AE x — x zp02 5.15 
zP) [Per (5.15) 
and therefore a formation time of the secondary photon of 
1 1 
At x —— 7x ; 5.16 
AE zp ( 
During any time At the pair separates in space by Ab 
Ab = bee At = P+ At, (5.17) 
PI 
applied to the formation of the secondary photon the ete~-pair separation is 
0 
Ab, 5.18 
xp\|Gey i 


In order for the photon to resolve the positron as individual point-like charge rather 
than just being emitted by the dipole with its zero net charge, the transverse wave- 


length of the photon 


1 1 
‘~~ . 
AE kı zp Oey ee) 
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must be smaller than the separation of the pair, 


0 1 
Ab x —— > ar, (5.20) 
2pyBe, ~ 2D\Bey = 
which is satisfied only if 
bee > Oye- (5.21) 


This qualitative picture of quantum coherence in the emission pattern above, lead- 
ing to angular ordering, can be supplemented with a proper calculation, which captures 
all of its essential features. The relevant part of the matrix element for the process 
y* + ete ~7¥ is given by the eikonal, 


fp H 
', _ oi» f Po P 
wor Bk =, (FS £) 


cf. Eq. (2.14). Squaring it yields the radiation function. For massless particles the 
velocity equals the speed of light, i.e., both |v], |] — 1. The classical analogy 
condensed in Eq. (2.12) is recovered, and expressed in angles the radiation function in 


both the quantum mechanical and classical case reads 


2(1— ñn ñ) B 2(1 — cos 6.+¢-) 


W it = — . 
ae (1 — n) (1 — ñn) (1 — cos 04+) (cos 0ye- ) 


(5.22) 


Here ři is the direction of the photon, and ñ+ are the direction of the two leptons. 
Following [504], this expression can be decomposed into two parts, related to the 
emission of the photon by either the electron or the positron, 


Wee = WO +W, (5.23) 


e- 


where 1 1 
wh = Wee 4 5.24 
er’ 3 1 — cosĝye+ 1 -— cos 0 eF ( ) 


This decomposition will be encountered again, in later parts of this section. The indi- 
vidual emission functions need to be integrated over the full angular region of photon 
emission. Concentrating on wh, the angular integral of the photon direction with 
respect to the direction of the positron is given by 


d’, = dcosO,e+ddbyer - (5.25) 


To solve this integral, in a first step, the term (1 — cos 04e- ) is expressed as a function 
of d,-+ such that 


1 — cos bze- = (1 — cos e+e- COS Gas) — (sin Oe+e- sin Gt) COS Pet (5.26) 
=a — bcosdyet . 
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The ¢ = ¢ye+-integration is simplified by introducing the complex variable z = et? 
and by restricting the integration over z to the unit circle, |z| = 1. Therefore, 


dz 
re ies 


dé = (5.27) 
Zz 
and * 
cos@ = ia - 4 (5.28) 
Putting it all together yields 
27 
fires if ddbyet 1 _i dz 
2m 1-— cosĝ e- 2T z (a — b 4) 
2 lat (5.29) 
A f dz -~d dz 
in bz? — 2az+b irb (z= z4)(z = z)’ 
|z|=1 |jz|=1 
where the poles are given by 
a a? 
Z4 = 5 a RB = 1. (5.30) 


For relatively small angles 0,.+ < e+e- only the pole at z_ resides inside the unit 
circle, and 


1 1 
T= = : 5.31 
a2 — b2 | COS Aye+ — cos e+e- | ( ) 
Combining the 1/(1 — cos 04e- ) term with the W.+-- in Eq. (5.24) leads to 
2m 
| ddbyet wh. = 1 _ COS Oaet — COS bete- 
Qqr Da 1 — cos ĝye+ ' | cos Oye+ — COSOe+¢-| 
0 
0 ee ee (9:82) 
= 2 
else. 


1 — cos Oy¢+ 


This proves the angular ordering property for each of the two individual radiation 
functions for the QED case discussed here, and thus also for the combined overall 
QED radiation pattern. 

In QCD a similar effect can be observed. Consider a colour charge, like a quark, 
emitting a first gluon under an angle O. Then, a second gluon emitted under an angle 
0 would resolve the individual colour charges of the quark and the primary gluon, 
only if 0 < ©. If, conversely, 6 > ©, then this secondary gluon would only feel the 
combined colour charge of the quark and the primary gluon, i.e., the colour charge 
of the quark only. Colour coherence therefore results in an angular ordering in the 
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emission pattern in final state radiation. 


5.1.1.4 First consequences: Rapidity distributions in hadron collisions 


The same reasoning, so far only employed for final-state radiation, of course also applies 
to initial state radiation: consider the scattering ab — cd of two partons a and b in 
the initial state resulting in two partons c and d in the final state. There are two 
interesting cases, namely the scattering process being driven by the exchange of either 
colourless or of colourful particles in the t-channel. 

Consider first the case of a colour-singlet particle being exchanged in the t-channel, 
for example by a photon, such that the two colour flows connecting particles a and c 
and particles b and d, respectively, decouple. The scattering angle Oac in the centre- 
of-mass system of the colliding partons will act as a discriminator for soft emissions. 
It turns out that emissions of a soft parton k off either the incident parton a or the 
outgoing parton c under angles Oar, Oek > Oac will be highly suppressed. 

To understand this remember that the formation time for a gluon with transverse 
wavelength A, emitted under an angle 0 from either parton a or parton c can be 
estimated as 

gm) as KiB (5.33) 


The transverse displacement of c with respect to the original direction of a during this 
time is given by 
pi = thre... (5.34) 


Again, it must be larger than the transverse wavelength of the emitted gluon in order 
for it to resolve the individual colour charges. For longer wavelengths any potential 
emission would be off a colour line passing through the process without any deflection, 
and therefore there is no associated Bremsstrahlung. This ultimately yields the angular 


ordering constraint 
Dak » Dok < Dae + (5.35) 


It implies that soft radiation off particles a and c — and similarly for emissions of 
partons b and d — is confined to a single cone of about twice the size of Oac. The scale 
characterizing the interaction is given by the Mandelstam variable f. Correspondingly, 
the transverse momentum of the outgoing partons c and d can be estimated as 


pi œx V-E. (5.36) 


Then, the momenta of the Bremsstrahlung gluons emitted in the process are con- 
strained by ky < pı. The amount of the resulting soft radiation is, roughly speaking, 
given by the sum of the colour charges emitting into the respective combined cones, 
i.e., the sum of the colour charges Ca + Ce = 2C, for the forward and Cy, + Ca = 2Cy 
for the backward cone. 

The way angular ordering dominates the emission pattern of soft partons in such 
singlet-exchange processes can be summarized as follows Taking into account their 
respective orientation, incoming and outgoing particles form coloured dipoles defined 
by corresponding opening angles 0. Soft emissions off the individual particles forming 
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Fig. 5.2 Sketch of the rapidity distributions of soft particles emitted in 
parton-parton scattering processes, mediated by the exchange of a colour-s- 
inglet or a colourful particle in the t-channel. In the latter case, in the 
high-energy limit, this particle typically is a gluon. The distributions are 
in the c.m.-frame of the incoming partons, and the outgoing partons in 


each case are fixed at rapidities of y = +2.5. 


these dipoles are confined to cones with an opening angle of 0 around the particle 
direction. 

Consider now the other case of the reaction ab — cd being transmitted by the 
t-channel exchange of a coloured particle. In this case the reasoning above does not 
apply, since the colour flows are not decoupled any more. Instead, now the region of 
emission angles Oar, Ock > Oac will be filled by emissions off the t-channel particle 
and is therefore susceptible to its colour charge. Since, typically, at large energies 
and relatively small scattering angles, i.e., in the region 8/f >> 1 gluon t-channel 
exchange dominates QCD scattering processes, the colour charge usually is C4. This 
is a universal property of QCD scattering at large energies and small angles. The 
additional soft radiation into central rapidity regions is independent of the scattering 
partons. It can also be shown that the soft particle distributions in the region between 
the two forward cones are fairly independent of the rapidity and more or less constant. 
This is quantified in Section 5.2.5. 

The findings are summarized in Fig. 5.2, exhibiting sketches of the rapidity dis- 
tribution of particles being emitted in the two cases discussed. The picture of such 
processes of course changes when further, hard emissions populate the central rapid- 
ity region. As this is a relatively rare process, suppressed by one factor of the strong 
coupling without any sizeable logarithmic enhancement, however, the overall picture 
for the bulk of the events is relatively well described by the reasoning above. 

A striking and highly relevant example for the impact of this aspect of the QCD 
radiation pattern in hadron collisions is the production of some potentially heavy sys- 
tem, such as, e.g., a Higgs boson, in the fusion of weakly interacting bosons (WBF). 
In this case the two outgoing partons c and d form forward tagging jets. These jets 
typically will have sizeable energies and relatively small scattering angles, as their 
characteristic transverse momentum scale will be given by the mass of the produced 
boson. A crucial part of the signal for such a process, allowing a very effective suppres- 
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sion of QCD backgrounds, relies on the fact that soft emissions under large angles are 
effectively absent due to angular ordering. As a result, in WBF events, QCD radiation 
in the central region between the two forward, opposite hemisphere tagging jets, is 
depleted [478, 482]. Therefore, vetoing events with central jets will typically result in 
a very effective suppression of the regular QCD background while the signal remains 
nearly completely unaffected [810, 820, 821]. 


5.1.2 Non-trivial consequences 
5.1.2.1 The hump-backed plateau 


The “hump-backed plateau” of the inclusive particle energy spectrum inside a 
jet [178] is one of the non-trivial consequences of the QCD radiation pattern. It is 
due to two competing and opposite effects: on the one hand, the restriction ky > 1/R 
forces subsequent emission angles to increase with decreasing energies w, 


0 ~ kı /ky ~ ki /w > 1/(wR). (5.37) 


On the other hand, angular ordering leads to shrinking allowed emission angles such 
that after a few emissions there is no viable phase space left for further softer emissions. 

To cast this more quantitatively, consider a jet emerging from a massless quark 
travelling with energy E. The gluers emitted by the quark translate into hadrons with 
transverse momentum p, relative to the jet axis and with energy e. These quantities 
are related to the emission angle 0 with respect to the incident quark through 


pı ~ ð ~ I/R. (5.38) 


The plateau in the distribution of hadrons with respect to the logarithm of their energy, 
dN/dloge = const., cf. Eq. (5.10), can be translated into a spectrum with respect to 


the angle, 
fde fab 
N x fof peeve, (5.39) 


where the 6-function encodes the phase space condition defining gluers. 

Consider a case where a hard gluon of energy w is emitted by the quark under 
an angle 09. Assuming that the radiation off the quark and the gluon can naively 
be added, i.e., assuming independent emission of secondaries without any quantum 
coherence, the gluon would contribute gluers with energies € given by 


1/R<e<w, (5.40) 
and their transverse momenta would be given by 
pı ~ e0 >1/R. (5.41) 


If this picture was correct, the number of gluers and therefore hadrons would read 
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= fe] (<0 — 1/R) 
a fE foum- ym fE T Baa- 


The condition on the gluon being hard, or, more precisely, harder than a gluer, is 
encoded in the Heavyside O-function. The maximal emission angle of gluers is given 
by Omax. In the incoherent case outlined above, this angle essentially is unconstrained, 
and for simplicity one would set it to Omax = 1. 

Following previous discussions it is clear that this picture of incoherent addition of 
the gluer spectra is overly simplistic. Encoding angular ordering in the upper limit for 
the angle of the emission of the gluers translates into setting Omax = 0o. Combined 
with the integral over the angle of the hard gluon emission the effect of including 
this upper limit or not, i.e., between adding gluers from the gluon coherently or not, 
corresponds to about a factor of one half: 


[2 2 dé S ifi m dé (5.43) 


This factor exponentiates with additional gluon emissions, thereby leading to a drastic 
reduction in the overall number of gluers emitted in the parton cascade. Ultimately 
it leads to a reduction of the soft part of the hadron spectrum produced in such a 
cascade, a testable result of angular ordering. 

Transforming the integral expression for the total number of hadrons into a spec- 
trum with respect to loge yields 


(5.42) 


aN iri S flog? (ER) — log? (eR)] for incoherent sum, fmax = 1 (5.44) 
—— Z E 
dlog e 1 + a, log — log eR for coherent sum, Omax = 9 
€ 


Instead of the incoherent energy spectrum peaking at energies given by hadron 
masses, 


(€) = (Ehaa) = z ~ ™Mhad 5 (5.45) 


the depletion of the soft part of the coherent spectrum results in a peak of hadron 
energies roughly scaling like (Epaa) ~ WE, the energy of the parton giving rise to 
the jet. In fact, the overall energy spectrum of the particles assumes an approximately 
Gaussian shape in the variables € = —logr, ~ —logaxg. For convenience here the 
scaled momentum or energy of the particles zp = |p|/E, €/E is used. In Fig. 5.3 
the hadron spectrum following from this discussion is sketched, and it is compared to 
data taken at e~ et colliders at various centre-of-mass energies. The figure shows that, 
qualitatively, the actual hadron spectra follow the form given in our relatively rough 
discussion. 
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Fig. 5.3 The hump-backed plateau in hadron spectra: in the left panel 
the impact of coherence is sketched. As discussed in the text, its domi- 
nant effect is the depletion of hadrons in the soft regime, i.e., for large 
values of log(1/F). This effect is visible also in the right panel, where data 
taken in e~ et annihilation from Tasso [283] and OPAL [132] at various 
centre-of-mass energies are displayed. 


5.1.2.2 The drag effect 


As a second non-trivial effect of quantum coherence in soft gluon emission consider 
the radiation off a qq dipole. As in the QED case, it is given by the eikonal term 


2 2 
a as dw d°OR 2pipj _ Os dw dìg 7 4 
ay Cr 2r w An (pik)(pjk) Fm w 4r aa (Ti) asn 
cf. Eq. (2.14) and Eq. (5.22), where 
2(1 — ngiz 
Waali) = - ofa) _ 
OAN Aia) 
foi , adia- ] [1 , u-i) 
1— ng  (1-— nng)(1-— nrg) 1— ng (1—7n,)(1 — tinig) 


(5.47) 


depends on the directions 7%, of the quark, fig of the anti-quark, and 7 of the emitted 
gluon. The two terms W, and Wz are of course nothing but the radiation functions of 
the individual particles constructed in Eq. (5.24). Each of them decomposes into two 
parts: the first terms in the square brackets, 1/(1 — ññg,g) represent the incoherent 
part of the radiation pattern, while the second terms in the square brackets account 
for interference effects and thus reinstall coherence. As has been shown above, after 
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Fig. 5.4 “Mercedes-star” topologies in qqy (left) and qg final states 
(right). They are characterized by relative angles of around 120° between 
the three particles. Due to momentum conservation their directions 71,2,3 
must lie in a plane in the c.m.-system of the collision. It is interesting to 
compare the radiation pattern of secondary gluons emitted in the ñ direc- 
tion midway between the quarks with the counterpart along 7’, between 
the quark and the boson. 


integration over the azimuth angle, this interference part is roughly equal in size to 
the incoherent part outside the cone with opening angle 64g such that 


dọ 2 


= a(n; Tig) ~ 


(W (ñi; ñia)) = 


A O(6qq — 949) - (5.48) 

The drag (or string) effect in the QCD radiation pattern is best studied in the 
emission of an additional (second) gluon with direction 7 from “Mercedes-star” qqy 
and qqg states. These are configurations formed in electron—positron annihilation, 
eet — qq + {y, g}, where the energies of the quark, the anti-quark, and the photon 
or gluon, Eq, Hz, and Ey g, and their relative angles are of roughly the same magnitude, 
i.e., 


E 
3 
“a (5.49) 
3 


see also Fig. 5.4. 
As an extreme example, consider first the radiation of soft quanta in the direction 
7m opposite to the boson (the photon or gluon), but in the plane spanned by the three 
objects — quark, anti-quark, and boson. In case of radiation off the qq@y final state, the 
only relevant effect of the photon emission is the corresponding reduced phase space 
of the quark—anti-quark dipole. The radiation pattern in this case therefore is given 
by 
as dw dN 


dwt? = Cp — — =- Tm 
j Fom w 4r 


Waal ñ), (5.50) 


essentially the radiation of a quark-anti-quark pair at a c.m.-energy reduced by the 
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amount carried away through the photon emission, and boosted back into the lab 
system. This boost thus induces a cone size of gee =~ 27/3 less than ie sm in 
the case of the pair in its own c .m.-system, which is compensated for by the suitably 
increased energies in the lab system. 

For the case of a gluon emitted instead off a photon, consider first a “QED” ana- 
logue, essentially replacing the gluon with a positron pair and the quark and anti- 
quark with an electron. This mimics the fact that a gluon carries about twice the 
colour charge of a quark, which in the limit of large Ne is exactly correct, as 

e- N2 -1 Neo Ne 


= ; 51 
Cr 2N. > 2 (5.5 ) 


In this toy model, the radiation pattern is given by 


dw“ — 2 dk pi PS 9 PS ? 
In (273)2w \ pik pok p3k 


dw d?20 1 Pa 
As AW nm > > 2 
om Seah An Wao + Waali) — a(t) , 


A simple calculation based on the individual W shows the complete absence of soft 
radiation in the direction of ñ: 


ame 2 (1 — cos $) 1 2(1—cos 4) 
(1 — cos 3) (1—cosr) 2 (1 — cos 2") 
z (1-os) 22-58 zii (5.53) 


In a similar way, in the direction 7’, opposite to one of the quarks, soft radiation is 
given by 


tv tt) + Walt) — F Wa] = 3-9. (5.54) 


This is to be compared with the case of the qqy final states, where 


2 (1 — cos 45) 3 


Walii) = a: 
wn ey? (5.55) 
Wa) = 2 (1 — cos ) oe: P i 
an) = (1 — cos 24) (1 — cos) De 


In proper QCD these quantities are of course given by fully including the correct SU(3) 
colour factors [176]. Denoting with 7, yet another direction, orthogonal to the event 
plane, this leads to the following ratios for the emission of a soft secondary gluon in 
the QED and QCD cases 
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dw (R) 1 dw9(f’) — 5N2—-1 22 

dwi (mi) 4? dwn)  2N2-4 7 N 
dwn (R) 1 dwa) = Ne+2Cp 17 

dwin (n) 4’ dwi9(r) ~~ 2(4Cp — Ne) 14 


and to the following ratios between the QED and QCD cases in the direction 7% opposite 
the boson: ae 3 
COED Nz -2 _ T (5.57) 
dwan (ri) 2(N2 — 1) 16 
A number of these findings are remarkable. First of all, in the QCD case the soft 
radiation in the direction opposite to the gluon is massively suppressed by a factor of 
about three (1/N.) with respect to the favoured region between the quarks and the 
gluon, around 7’. Possibly even more noticeable, the destructive interference between 
the three dipoles renders this direction, 7, more unfavourable than the direction ñ, 
orthogonal to the event plane, which naively is susceptible mostly to the combined 
colour charge of the system, namely 0. This depletion is relatively easy to understand 
with the help of the toy model. There, the “gluon” was made of two equal charges, 
two positrons, and the quark and anti-quark were identified with two electrons. In the 
symmetric Mercedes-star configuration the two equal charges compensate each other 
in the direction of ñ. A similar effect also is responsible in the case of QCD where 
the coloured gluon leaves the quark and the anti-quark with nearly opposite charges. 
Secondly, the radiation midway between the gluon and quark, in direction 7’ is well 
enhanced, due to constructive interference effects. This becomes most visible when 
comparing the soft radiation there with the soft radiation in the favoured direction ñ 
in the qq@y final state, the QED events. The ratio of the soft radiation in both regimes 
is about 11/8, an enhancement by about 30% in the QCD case. Finally, it is worth 
noting that in the QED case, soft radiation in the directions 7’ and ñ, are similarly 
disfavoured, by a factor of four with respect to the “best” direction. By far and large, 
these findings have been confirmed by the JADE collaboration in [203]. 


5.2 Analytic resummation techniques 
5.2.1 Basics of Qr-resummation in QCD: b,-space 


In this section, essential ingredients for QCD resummation are recapitulated and fur- 
ther generalized. This will extend the introductory discussion in Section 2.3.2, where 
the structure of a general expression resumming leading logarithms to all orders has 
been exemplified for the pı spectrum of a W boson in hadronic collisions. 


5.2.1.1 Logarithmic accuracy 


In the kinematical situation where the transverse momentum is much smaller than the 
invariant mass of the boson, Q1 x < Qx, the production cross-section has contribu- 
tions of the form 


ni m Qix 
a, log 


1 
or a where m < 2n- 1. (5.58) 
1,x 
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Although these terms seem to be highly singular for Q1,x — 0, when properly re- 
summed they reorganize into the finite Sudakov suppression factor already encountered 
in Section 2.3.2. The transverse momentum Q1,x of the boson is of course nothing 
but its recoil against (soft) gluons that have been emitted by the incident partons, 
lending it the name of “soft-gluon resummation” or “transverse-momentum 
resummation”. 

The formal accuracy of resummation methods is classified according to the order 
m of the logarithms above that are being taken into account in the resummation of all 
orders in the strong coupling, n. If all terms m = 2n — 1 are included, the resummation 
is said to be of leading logarithmic (LL) accuracy. Of course, integrating the LL 
expression over all transverse momenta above some minimal value Q1,x will result in 
terms of the form a? log™” (Q3 x/Q%). 

For higher logarithmic orders, there is some dispute in the literature concerning 
nomenclature. For the purpose of this book, however, a convention is chosen, where 
including terms with m > 2n — 2 refers to next-to-leading logarithmic (NLL) 
accuracy. Including one more logarithmic order, i.e., all terms with m > 2n — 3, will 
be dubbed NNLL-accurate and so on. 


5.2.1.2 Qr resummation in the Collins—-Soper—-Sterman approach 


In Section 2.3.2, the CSS expression for the resummed double-differential cross-section 
has been given as 


do 7 d?b 
SCAB XK _ 5o ee {| = [expt ĠW, ij(bL; Q, A, 2n)| 
ij 


dydQ?, (Qn)? es 


+ Vizsx(Qi; Q, z4, vp) ; 


cf. Eq. (2.166). The two Bjorken-parameters x4 and xg in this equation are entirely 
fixed by the invariant mass of the singlet system, Q?, and its rapidity y. As already 
discussed, the resummation part Wij and the hard remainder Y;; can be written in 
terms of various coefficient functions. However, before discussing their form, it is worth 
noting that it has become BR to evaluate the PDFs in both the resummation 
part W;; and the hard remainder Y;; at a common scale up. While at LO/LL accuracy 
this presents an irrelevant shift beyond the actual perturbative accuracy this particular 
choice has to be taken into account at higher orders. In the resummation part Wij 
the common scale up is corrected to the “natural” factorization scale 1/b,; this is 
achieved in the collinear terms Cia and Cj, which are a part of Wij, see Eq. (5.60). 

Therefore, employing this choice, and suppressing the scales as arguments of the 
two parts in the expression for the resummed cross section, 


Wij(b1; Q, £4, a 


as i =o jg Fe A haja (Ea È) fore (En, 1) 
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Q? 
dk? 2 
x exp |— / a (A) ow $ + B) (5.60) 
B z 
and 
1 ae 1 í 
¥ij(Q13 Q, tA, £B) = JZE fija (Ea, Ue) fia (EB, HF) 
LA LA 


(5.61) 


x Rij=>x Ce Q, A zz) , 


see Eqs. (2.167) and (2.170), respectively. The term 6? appearing in the lower limit of 
the &,-integration in the Sudakov form factor typically is chosen as 
bo = 2e7 7” (5.62) 


with the Euler—Mascheroni number yz (cf. Appendix A.1.1). 
As already discussed in Section 2.3.2, the coefficient functions in the master equa- 
tion above are expanded as perturbative series in as, and read 


ax (74. "20,.2) = aa ~) ( A z) 
Rijyox (2. EB’ Q, n) = PE ( 2r Ris 4x Q, Ea’ £B š (5.63) 


Typically the first terms of the functions A, B, and C depend on the incoming particles 
only but not on the specifics of the produced singlet system X, while the hard terms 
H as well as the functions R are process-dependent. 


Analytic resummation techniques 287 


It is interesting to note here that there is some freedom in how different contri- 
butions are accounted for. This is in particular true for the hard higher-order (i.e., 
loop) corrections encoded in Ha»p+x and the collinear coefficients Cia. In the original 
formulation by Collins, Soper, and Sterman[410], implicitly Ha»+x = 1 was chosen 
and the loop corrections were encoded within the Cia. In contrast, in more recent work 
by Catani, de Florian and Grazzini [342], the term H,»-,x was explicitly introduced 
and chosen different from unity. 


5.2.1.3 Process-independent terms 


The first terms in the expansion of A and B do not depend on the process in ques- 
tion but on the incoming partons only, see also Section 2.3.2. Focusing on gg and gg 
annihilation processes into singlets, this is reflected in the respective charge factors 


Cy = Cr and Cy = C4. (5.64) 


The higher terms A) are given by the coefficients of the soft part of the DGLAP 
splitting function Paa with a = q, g to the nth order in ag: the A“) is the coefficient 
of the soft terms in the leading-order splitting functions PR, cf. Eq. (2.33), i.e., 
by the term that comes with 1/(1 — z)+ with the numerator taken in the soft limit 
z— 1. The A?) and AC) merely modify these terms with universal factors encoding 
the soft one- and two-loop corrections to these kernels. Therefore [502, 689, 767, 883], 


AD = 2Cy,9 
67 r? 10 
A) = 2C4, g K = 2Cqg fes E = T) = EPan! 
AY = 2044 K' 
245 677? 1l u ry? 55 
= 2 i 99 
Gas fos A 96 + 5 (3) 4 5 (=) +Crny | = + 2c(3) 


209 107? 7 Psa 


Here, ¢(3) ~ 1.2021 is a special valiue of Riemann’s ¢-function (cf. Appendix A.1.4). 
At first sight it may seem a bit of a coincidence that, apart from a global prefactor 


2Cq,g, which depends on the particle in question, the higher-order terms in AGP are 
independent of the particle. This, however, is fairly straightforward to explain: the soft 
terms 1/(1—z)+ which give rise to A(k? ) in Eq. (5.60) stem from an eikonal expression 
for soft gluon emission. As already seen, such an eikonal, being quasi-classical does 
not show any dependence on the details of the particles emitting the gluon beyond 
charge factors — it is sufficient that there are sources for gluon emission, their exact 
characteristics such as spin are beyond the resolution power of the long wavelength 
QCD fields. 

The B terms essentially can be identified with the factors in front of the ô(1— z)— 


terms in the one-loop splitting functions PM) (z), cf. Eq. (2.33). They are also known 
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as the anomalous dimensions a) of the splitting functions, 


BY = -9% ., (5.66) 
Therefore [502], 
B® = -3Cr 
E A (Fe. 5Tans) (5.67) 


In contrast to the A”) terms this way to extract the coefficients through a simple 
procedure does not trivially extend to higher orders: in fact, for all n > 2 the B™ 
terms also contain parts that depend on the hard process in question.! 

Furthermore, the first terms in the expansion of the collinear subtraction factor 
are given by 


(5.68) 


The terms proportional to the first-order splitting kernels pO, cf. Eq. (2.33), account 
for the PDF evolution from ppr to the correct scale bo /b,, which would be the natural 
choice in the resummation (to which the Cia contribute). They thus reflect the scale 
choice already hinted at. Furthermore, the P$ (z) denote the O(e) terms of the splitting 
kernels at next-to-leading order and originate from the way collinear divergences are 


treated in the M S-scheme. They read 


= — Cp (1-2) 


) 
me (5.69) 
) 


II 


— 2Trz(1 — z) 


These terms as well as the ones proportional to 77/6 should not come as a surprise. 
They are nothing but the finite collinear terms that stem from the € expansion in the 
subtraction, where 1/e poles conspire with terms proportional to e in the splitting 
functions to yield finite contributions to be absorbed into the PDFs. These terms, the 
P*, are thus readily identified with, for example, the corresponding terms in fixed-order 
calculations. 


1The identifcation of the BO) terms with the anomalous dimensions is the reason why parton 
showers can be shown to provide an approximation to the p, spectrum of colour-singlet objects in 
hadron collisions, which is accurate up to the next-to-leading logarithmic level in the Sudakov terms. 
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5.2.1.4 Process-dependent terms: B®) 


The B®) terms have been worked out for example for the cases of Drell-Yan production 
in gg-annihilation in [436] and for Higgs boson production in gluon fusion in [443, 444]. 
Analysing its structure in more detail, the authors of [444] were able to relate it to 
the anomalous dimensions, ny?) in the two-loop splitting functions PË, i.e., their 
coefficients of the 6(1 — z)-terms, plus a term proportional to the one-loop correction 


to the leading-order amplitude, ACs 
(2) (2) 27° (loop) 
Bo’ = —27a" + bo | =z Ca + Aa ™ ) , (5.70) 
where 
Sedi em alaa a o a hs 2 
= — — — — _ — NE — — — 
Va Fig 2 FA |34 18 TRSA IG 9 
8 4 
P EnA $ + xo) — CrTrnf — gCarany - (5.71) 


This ultimately results in the expressions known from the literature,? namely 


3 11r? 193 
Beg = Cr G A 12608) + CrCa a= BEET gg 6c) 
17 4r? 
CrTrne | — — — 
FCOFLRNf E 9 
23 227? 2 8r?] 11 
Byyon = 0 É +2- 666) +4CpT ans — CaTrng f if z] s OR 


5.2.1.5 Process-dependent terms: the finite loop correction 


For the phenomenologically relevant cases of Drell-Yan, W-boson, and Higgs-boson 
production, the first non-trivial term in the expansion of Hab—x reads 
(1) (1—loop) 
Hox = Abox 3 (5.73) 


where the finite loop contributions A17 1°9P read [142, 438, 474] 


ab— 


—loo 27? 
gG) = Cp (-8+ ©) 


qq Z,qq'—W+ 3 
(5.74) 


gg H 


ee 2r? 
Ato) _ Cy (5+ 72) -ace. 


2When comparing with literature, care has to be taken, since at this order differences between the 
by now customary MS-scheme and the DIS-scheme start showing up, leading to some shifts that are 
cumbersome to trace. 
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Note that here the loop correction to Higgs boson production has been evaluated in 
the m — oo limit. 


5.2.1.6 Process-dependent terms: The hard remainders R,y_,x 


In order to calculate the hard remainders, it is important to understand the principle 
underlying their construction. This will be worked out with the example of the term 
Re w)» the hard remainder to O(as) for the production of a W boson in the quark- 
anti-quark annihilation channel. The starting point is provided by the real-emission 
cross-section for gq’ — Wg in Eq. (2.111), where they have already been provided in 


a form suitable for the further discussion here: 


do dx dx 
toa, I es B ftua (£a, HF) faye (EB, ur) S++ û- miy) 
(LO) 1 aCr È +â? + mis 


x o iow l oe on 


8 


(5.75) 


From this expression all terms at O(as) must be subtracted, which are already 
present in the resummation part Wij, but which do not originate from a genuine higher- 
order correction. At NLO, this does not include the generic higher-order corrections 
encoded in the terms C and H“). While the former originate from the treatment 
of collinear divergences at NLO, which are absorbed into the PDFs, the latter are 
genuine loop corrections. Therefore both can be ignored for the R“. This reasoning 
results in the following contributions, to be subtracted: 


e terms, where one of the incoming partons remains at its light-cone momentum 
fraction, z = 1, and where the PDF evolution of the other parton through the 
DGLAP splitting function and the corresponding shift in its momentum is taken 
into account. This will yield terms of the form 


—, 6(1— z4) P(zp). (5.76) 


e terms which have their origin in the expansion of the Sudakov form factor to 
the first order in as. With them multiplying a Born-like phase space, they are 
proportional to 6(1 — z4)d(1 — zg). Therefore, overall, they amount to a term of 
the form 

d(1 — z4)d(1 — zp) 
Qt 


For the example at hand, therefore 


xt B®), (5.77) 
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Q? 
—d(1— z4)ô(1 — zp) (2 log => — 3) 
Qi 


5(1 za) (G28) al m (1A) bx 


1 — zB 1 — zA 


Contact with the literature, and in particular with [410], where the R™ term has 
been worked out for the first time in the context of Qr resummation, is established 
by identifying 

P +a +2m2,8 = (Q? -AR - a)’, (5.79) 


using the Mandelstam identity 
my =Q =s+t+ia. (5.80) 


The hard remainders for the qq-initiated parton-level process are given by 


1 (1) 1 f CA++ pen 2 
Raw = Rigw z An { ee O(§+t+u Q’) 
2 i= 2 
(1 — za) A 2 a \ 
L 
eN (5.81) 
1 (1) 1 (E+) t tA n a 
Rw =aaw = ae { a 6(8+t+a—Q’) 
2% +(1— 2,4)? 
6(1 B) A = A) \ 
L 


To achieve the subtraction in a form which lends itself to the implementation in 
a computer program or similar, the kinematical quantities in the resummation part, 
and in particular the 4,g and the transverse momentum Q1, need to be mapped onto 
the Mandelstam variables used in the finite remainder. This is achieved through the 
relations 


Q3 5 
1 : Vito i 
3 Q, i pt erg o = m (5.82) 


eee a EBA 


> 
l 


In addition, there will of course be those terms that originate from real-emission 
processes initiated by other partons. In the example of W production, such terms 
are related to processes with a gluon in the inital state, like qg —> Wq’. Such terms 
do not need special care in subtracting out contributions already accounted for in the 
Sudakov form factor, since they are simply not present, and only those terms involving 
the PDFs must be subtracted. 

When going to higher orders n > 2, of course, the picture becomes a bit more 
involved. First of all, of course, the finite multiple-emission matrix elements assume 
an increasingly complicated structure, eventually mixing loop corrections with real- 
emission terms, which will make it harder to identify terms in a straightforward way. 
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This, however, is of course merely a nuisance, since they could in principle be taken 
from numerical routines. However, at the same time, in the expansion of the resum- 
mation part, the terms A("~*) and B(—*) from the Sudakov exponents will combine 
with terms of order k in the finite parts, the C and H, which will become process- 
dependent. 


5.2.1.7 Soft regime: b; — co 


Another problem, up to now more or less swept under the carpet, needs to be ad- 
dressed. It is related to the fact that the Fourier transform to impact parameter space 
necessitates an integral over all b,, from zero to infinity. While the short distance 
part, bi — 0 is no problem at all, the long-distance part with bı — oo is, and for two 
reasons. First of all, integrals with infinite integration limits are notoriously unpleas- 
ant to deal with and guaranteeing their convergence to the true value is sometimes 
not trivial, especially when aiming for high numerical precision. While this is a merely 
technical problem, the physical problem in this region is much harder to solve in a sat- 
isfying fashion. It is related to the fact that in all integrations, the PDFs are evaluated 
at scales up œ 1/b,, which therefore may be taken into unphysical infrared regions 
where Q? — 0. Similarly, in the Sudakov form factor, the strong coupling is evaluated 
at scales k? , but, again, the integration region of k? extends down to values of 1/b7 
thus also probing the strong coupling in the infrared regime and eventually hitting the 
Landau pole when b; — co. 

As already indicated in Section 2.3.2, the dangerous region of large bı can be dealt 
with by multiplying the resummation part Wi; with a non-perturbative modification 
factor. This is often supplemented by suppressing large values of b} by replacing it in 
Wi; with a modified b, defined through 


Wij (bi, -..) — WOP) (bL, ...) Wiz (Bay +); (5.83) 
where 7 
bs = = (5.84) 
1+ (b1 /bmax)” 


There are different parameterizations for this non-perturbative suppression factor be- 
yond the simple Gaussian of Eq. (2.3.2), namely 


p 2 
Woe (b7 ) = exp [Fi bx) og (5) — Fishy (x1, bi) — Ejla (xa, o| 


—g1b? — gob’ log 


z,(DWS) /,2 TX 
Wi (bi) = exp 


Wie OR) = exp 


Wwe (b3) = exp [me — gob% log (2) — gıg3b1 log(1007122)| 
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Table 5.1 Fit values for different parameterizations of the non-pertur- 
bative function WOP), taking into account a variety of TEVATRON Run 
I Z-boson data, from [716]. Note that more recent fits of the BLNY form 
tend to use bmax = 0.5 GeV", thereby extending the perturbative regime. 
These fits quite often lead to practically vanishing g3 % 0. 


| DWS [437] | LY [711] | BLNY [716] 
gi[GeV"] 0.016 0.02 0.21 
g2[GeV?] 0.54 0.55 0.68 

g3([GeV—*] -1.50 -0.60 


The parameters in some of the parameterizations above have been fitted in [716], 
resulting in global values of 


Qo = 1.6GeV and bmax = 0.5GeV!. (5.86) 


and in values for the parameters g1, g2, and g3 as listed in Table 5.1. The functions F; 
and F’/;, in the more general form of the non-perturbative function in wo should, 
in principle, be general and would have to be extracted from data. However, by far 
and large, this approach has not been followed. 


5.2.1.8 Practicalities of implementing the calculations 


One of the main problems with the evaluation of Eq. (5.59) is that regions of large 
impact parameter, b} — oo, contribute to the Q x spectrum at all values, because 
of the Fourier transform from Q, to b,-space. This has physical and technical im- 
plications, which need to be addressed. Starting with the physics problems, due to 
this effect non-perturbative effects have an impact to observables in the perturbative 
regime. In practical implementations, typically, this problem is being solved by either 
adding a non-perturbative form factor in the spirit of Eqs. (5.83) — (5.85), essentially a 
dampening at large bı, with parameters to be fixed through comparison with experi- 
mental data, or, by a cut-off b., cf. Eq. (5.84) or a combination of both. The down-side, 
however, of the non-perturbative dampening factor or the cut-off is that they might 
of course still introduce visible consequences at finite, and possibly large, Qi x, a 
somewhat counter-intuitive situation. Furthermore, the strong coupling is evaluated 
at scales proportional to 1/b1, i.e. at scales where the strong coupling diverges in the 
infrared region. To remedy this, some freezing or similar must be invoked, in order to 
guarantee that the ag, does not touch the Landau pole and diverges. In principle the 
same is also true for the PDFs, but there is no Landau pole lurking and the different 
scales could be simply connected through the DGLAP equation with the caveat that 
then, of course, problems with the scale in the strong coupling may raise their head 
again. 

Finally, as an added technical problem, the exponential term of course oscillates: 
after integration over the relative angle, exp(iQ 16 1) yields Jo(b1 Q1), a Bessel func- 
tion. This renders the matching to the more accurate fixed-order results at large values 
of Q, a somewhat subtle and potentially numerically unstable exercise. 
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One way to circumvent the problem with the large b] integration comes from the 
realization [712] that the exponential leading to the Bessel function can be rewritten 
as 


> CO 
J exes Ga) $1) = gy | OQd) Fb.) 


0 
1 foe) 
=> [eats [meb v) + Malaba, v) f(b.), (5.87) 
0 
where 
1 —nHiVn i e 
hi(z, v) Z of dO e778i29 and ho(z, v) een / dO ew iz8in 8 | (5.88) 
A T 
—ivn T+ivn 


These functions reduce to the usual Hankel functions H;,2(z) for z + oo. They are 
finite for all finite values of z and v and fulfil 


hi(z, v) +ha(z, v) = 2Jo(z), (5.89) 


independent of v. Ultimately these functions allow the evaluation of the original b, 
integral as a function of two contours in complex space, one for each of the h;. In so 
doing it is possible to deform the contours such that the Landau pole is avoided, a 
procedure that is equivalent to the original integral for all finite orders in perturba- 
tion theory. This is a solution that is being used in Qr resummation, along with the 
more simplistic cut. Astonishingly enough, both solutions regularly yield numerically 
equivalent results [707]. 


5.2.2 Resummation in b,-space for specific processes 


In the following some example results for the Qr-spectrum for various (singlet) final 
states are provided. 


5.2.2.1 Z production 


It is straightforward to arrive at expressions for all terms relevant for the production 
of a Z boson in gq annihilation and the respective higher-order corrections: The Born 
cross-section for W production given in Eq. (2.67) has to be replaced with the one for 
Z production. This can be achieved by adapting couplings, effectively replacing 


e? K — Ale;| sin? Oy)” + 1| Sij 


2_ eWay? waz 
| = — 3 > a) 
sin* Oy 4sin* Ow cos? Ow 


(5.90) 


giv Vij 


All other terms in the resummation part of course are not susceptible to the details 
of the gauge boson production, since they merely account for QCD effects in the 
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initial state. For the hard remainder, the same holds true, after the couplings in the 
real-emission matrix elements have been adapted as above. 


5.2.2.2 Higgs production through gluon fusion 


Py distribution of Higgs boson 
gluon fusion at the 14 TeV LHC 
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A NNLO+NNLL (matched) 


py [GeV] 


Fig. 5.5 pı spectrum of the Higgs boson at the 14 TeV LHC, evaluated in 
the mı — infty approximation with the HqT code [282, 442], and ignoring 
contributions from b-quarks. Black lines correspond to results contributing 
to or accurate at NLO+NLL accuracy, while the red lines show results or 
contributions to the NNLO+NNLL result. Dashed and dotted lines show 
the resummed and fixed-order parts only, while the fully matched results 
are shows as straight lines. For all results the CT10 PDF NLO set with 
Qs = 0.118 has been used, with wa HF LR = my/2 = 62.5 GeV. 
Non-perturbative effects are not included. 


For Higgs production through gluon fusion mediated by the effective vertex, the 
structure of the result at NLO-+NLL is identical to the one for the production of vector 
bosons in quark—anti-quark annihilation. The LO cross-section is given by 


(Lo) _ V2Gr as(mi,) 
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and, of course, it must be convoluted with suitable (gluon) PDFs. Similarly, in the 
Sudakov form factor, the terms A and B are the ones for gluon processes, taken from 
Eqs. (5.65) and (5.67), and the collinear terms again are those for gluons, this time 
taken from the corresponding item in Eq. (5.68). The hard loop correction is taken 
from Eq. (5.74). The same strategies already employed in the case of vector boson 
productions also will yield the results for the hard remainder terms R. 

In Fig. 5.5 results for the p; spectrum of a 125 GeV Higgs boson at the 14 TeV LHC 
are exhibited, at NLO+NLL and NNLO+NNLLE accuracy and in the large top-mass 
limit. The results have been obtained with HgT [282, 442]. The fixed-order correction 
in the matched result has two prominent effects: first, it changes the overall cross- 
section, and second, it fills the tail of the distribution at scales of the order of half the 
Higgs boson mass or above. 


5.2.3 Qr-resummation: calculating with Mellin transforms 


Up to now, potentially large logarithms of the type log(Q%/Q7. x) of the transverse 
momentum Qr,x of a heavy (colour-singlet) system X with mass Qx have been re- 
summed in impact-parameter or b,-space. This variable essentially emerges from a 
Fourier transformation, and is conjugate to the transverse momentum. It has been 
introduced to guarantee that the transverse momenta q, of the emitted soft gluons 
combine to yield the overall transverses momentum of X such that 


Qux = 5 Li- (5.92) 


As a by-product of this approach, the various pieces in the master equation Eq. (5.59), 
and in particular the resummed and finite remainder contributions Wij and Yi;.x 
emerge through a convolution of various contributions with the PDFs, where the nat- 
ural scale for the evaluation of the latter is given by 1/b1. This cross-talk induced by 
the convolution sometimes renders an analysis of the individual contributions a tricky 
task. The somewhat unsatisfying situation can be greatly alleviated by using Mellin 
transforms which allow one to rewrite convolutions as simple products. The price? to 
pay for this seemingly superior procedure is the necessity to transform the results, 
obtained in a simpler way in Mellin space, back into x-space, which often is highly 
non-trivial. 


5.2.3.1 Resummed cross-section in Mellin space 


The resummed part of the cross-section for singlet production in Eqs. (2.166) and 
(5.59) can be manipulated in such a way that the Bjorken parameters x4 and ap are 
not fixed anymore. To make contact with the literature, e.g., [436], this can be realized 
by integrating over y and considering only do/dQ?. Ignoring the hard remainder, 


3This is a perfect example for an important conservation law: conservation of pain. 
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1 


Qdos a x d?b 
ABX as Regs J dvaden la o(Q1b1) Wij(b1; Q, x4, £B). 
0 


dQ? dQ? 27)? 
(5.93) 
Introducing the rescaled invariant mass variable 
Q? 
= — 5.94 
per (5.94) 


with S the hadronic centre-of-mass energy squared, allows the definition of the Nth 
moment of the cross section above with respect to T: 


“Te gue le C2 a0 
Sas) = / arr" a) GORE (5.95) 
4 AB>X 


This constitutes a Mellin transform of the normalized differential cross section. Among 

other convenient factors, it has been multiplied by Q? to cancel the singular 1/Q? 
behaviour. The upper limit of the integral approximates the kinematic boundary for 
soft particle emissions. It could be set to one under the assertion that the integrand 
does not have any support for larger values of 7. As discussed in some detail in Ap- 
pendix A.1.4, the trick about using such Mellin moments is that the cross-section 
neatly factorizes into separate contributions from PDFs and partonic cross-sections 
without complicated convolutions of both: 


Dapox(N)= >> Livan ee) fijB(N, ur) dag(N)] (5.96) 
tj 
Here the partonic part Sy (N) collects all the terms in the Sudakov form factor and 


the collinear functions. After the integration over the angle between 5 ıı and Q j it is 
thus given by 


SuN) = QED | EE 4 Jol Qs) CialN, 04(08/8E)) Chal, 9(68/02)) 
0 


Q? 
dk? 2 
xop- S F (Aate + Ba) 
B T 
dk? l 


298 QCD to All Orders 


where Jo(x) is the Bessel function from the angular part of the integral over impact 
parameter space. 

At this point it is useful to compare the result with the original expression in 
Eq. (5.59). Apart from normalization factors, there are a few differences due to the 
Mellin transformation. First of all, the convolution of the collinear functions C with 
the PDFs is replaced by a simple product of their Mellin transforms, the C(N) and 
fi,j/A,B> and the summation over the Mellin moments. With the PDF's at up factored 
out, their DGLAP evolution to the scale 1/b, is captured by the integral over their 


anomalous dimensions, the y(V). In b space this was accounted for by the terms 
proportional to P® log(b? uz), cf. Eq. (5.68). In Mellin space these terms are not 
present any more. For the case of Drell-Yan-like processes they are therefore given by 


(remember i = a = q) 


Cia(N, t a4(t8/02)) = My [E (2 E ue) | 


z j dzz" 90 jyaw ( Pie) + diaô (1 0a) +003] 
0 


: ! =) +0 (a2). (5.98) 


izazq , , Cras() 
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Note that in the result above the constant terms are at variance with the result quoted 
for instance in [436]. This is because there the hard loop contribution has been absorbed 
into the collinear functions C. The Mellin transform of this additional part here is 
trivial, since the hard loop contribution is proportional to a -function in x-space, 
which yields unity. 

Due to the absence of the analytically hard-to-control convolutions in «-space and 
their replacement by mere products due to the Mellin transform, it is now possible 
to try to identify logarithmic terms in such a way that the integral over the residual 
impact parameter b} can be handled with an increased level of analytic control. An 
important step into that direction is to replace the looming logarithms of b, in the 
Sudakov form factor, which will need to be integrated over, by logarithms of Q1 
instead. 

To achieve this, the integrand is expanded in a power series of as, while collecting 
all logarithms of the form log(Q?b? /b8): 


(5.99) 


The term L = log(Q?/3,) stems from the second term in the exponential, by moving 
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the upper limit from y% to Q? to combine it with the original Sudakov form factor 
terms. Typically, however, this is not a large logarithm in singlet production. The other 
logarithms, log(Q?b7 /b2), in contrast are potentially large, since, after integration over 
b1 they will result in logarithms of the form log(Q?/Q? ). They stem from the Sudakov 
form factor and the terms « Yq;(V) contribute some of the sub-leading logarithms. 

Up to second order in as, the nDm, with dependence on the parton flavours a and 
b implicitly understood, are given by 


1 
1D2(N,L) = — 5 4” 
1Dı(N, L) = — B® — 2y(N) 


1Do(N, L) = 2y)(N)L +20 (N) 


1 (5.100) 
2D3(N,L) = — 30 

1 4) lawr pw _ yo 
2D2(N,L) = — 5 AN’ + Bo | a A E- 5 BAEN) 
2Di(N,L) = — B®) — 27) (N) + Bo [ame + 20N) +200 (0)| 


In order to extract logarithms of log(Q?/Q?) instead of those of the modified 
arguments, the authors of [436] formulated a similar expansion of X;;(N), namely 


co ntl 


buj(N) = L pe 5 Cri, Li 0,0) (e) (io AT . (5.101) 


n=1m=0 


The coefficients pm have been obtained by a direct fixed-order calculation, which 
explains the lacking exponential. They further differ from the „Dm due to different 
logarithms. The nm were matched in [436] to the nDm by expansion of the exponen- 
tial in Eq. (5.99). 

To push this line of reasoning further, the authors of [505]* have analysed the resid- 
ual difference between the two expansions. To this end, they considered the integration 
over b; which becomes feasible by using the relation 


d[xJi(x)| 


az = rJo(x) (5.102) 


between the Bessel functions and through integration by parts. The boundary terms 
for large impact parameters vanish due to the exponential dampening encoded in the 
Sudakov form factor, and therefore, after substituting x = bi Qı 


4In fact, they were hoping to be able to construct a resummation procedure directly in Q space 
rather than b] space. This was driven by the observation that the latter method invokes an integration 
over all values of b; inducing long-distance physics even for relatively large values of Q1. 


300 QCD to All Orders 


(N) = oS l dzJı(x 
a SEs D (SPY (ode) I) 
(5.103) 


If log(x/bo) = 0, or, alternatively, setting Q1 = b1/bo in Eq. (5.99), the original 
structure of the logarithms encoded as log(Q?/Q% ) becomes visible. It is possible to 
split off the logarithms log(x/bo) and careful analysis shows that they only contribute 
at the N°LL level. This can be seen by expanding the exponential in powers of this 
logarithm, which combined with the Bessel function yields 


1 form=0 


i m 0 for m= 1, 2 
J dzJı(x) log™ (a/bo) = ~1¢(3) for m =3 (5.104) 
form > 3 


under integration. With this in mind — and ignoring the residual sub-leading terms 
in the exponential — the integral over x reduces to unity and the cross-section can 
be written purely in Q1 space. This basically amounts to the replacement of the 
b, -integrated Sudakov form factor 


Co 


dk? 5 77,(non—per 
i; db, by Jo(b1Q1) exp / E (a -B) Wee eet (5105) 
0 yee a 
with the Q -space expression 
2 
d dk? /,, Q? = 
Alog ~+B)| . 5.106 
age | | et (Peat oo 
2 
pe 


Here, the Q-space parameters A and B coincide up to NLL accuracy with their b] - 
space counterparts and the relative differences are exactly calculable. However, despite 
its elegance this approach ultimately could not be extended to accuracy higher than 
NLL and it therefore was not further pursued. 


5.2.4 Threshold resummation 


Soft gluon emissions off the initial state in production processes at hadron colliders do 
not only contribute to the transverse momentum distribution of the produced systems; 
they also induce large logarithmic corrections to the inclusive production cross-section. 
In particular, this is the case when a heavy system of mass Q is being produced. Then 
the emissions of gluons off the incoming partons are connected to the splitting function, 
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which encodes a logarithmic enhancement in its soft part « 1/(1—z),, when z > 1. 
The terms giving rise to these logarithms are typically of the form 


n [eea] (5.107) 


s 1—z 


Kinematically, this limit is approached when the combined centre-of-mass energy of 
the incident partons approaches the mass threshold of the system, Vê ~ Q, squeez- 
ing the phase space available for gluon emission. In this limit, therefore, the gluon 
emissions will not visibly change the kinematics of the produced system, their contri- 
bution however will change the production cross-section. This threshold effect on the 
cross-section was first found in [168, 354, 355, 863]. 


5.2.4.1 The factorized cross-section, once more 


Consider the differential cross-section for the production of a system X 


Q? 
a (5.108) 


1 T 

do 

Se = faf dada; fija (ti, ur) fj/B(j, Ur) 4 (-- 
0 0 


i o (T, Q, HR, Lr) , 


where Q is the mass of the system and vS the hadronic centre-of-mass energy. As 


before, the renormalization and factorization scales are given by upr and pr, and 

(thres) 
Wij 3 
perturbation theory. In contrast to the function W;; from Qr-resummation, no large 
logarithms explicitly show up in ao Instead, they emerge through convolution 


with the PDFs. Their origin are terms inside We which become singular in the 


(T, Q, HR, Ur) encodes the hard partonic cross-section, which is calculable in 


limit rT > 1 or z — 1 and diverge like a” log?"~'(1— z)/(1 — z). Sub-dominant terms 
come with smaller exponents for the logarithm. Contributions from initial-state gluons 
splitting into quarks or initial quarks radiating a quark play no role here, since only 
soft gluon emissions feature the 1/(1 — z) term in the splitting function. 

After a Mellin transformation, the relevant term reads 


1 
My |[W§"™ (7, Q, ur, ur)| = Mw [Wij] = / drr” wi") (r, Q, ur, MF), 
0 


(5.109) 
with wr = Q as the usual choice. In the limit of soft gluon emissions, 
2 
TF Q z1 (5.110) 
SLiTj 


and therefore the behaviour of My [W;,] is defined by its limit for large N. 
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To make physical sense of it and to then evaluate this contribution, it is important 
to remember that soft gluon emissions can easily be treated using the eikonal approx- 
imation introduced in Eq. (2.17). This approximation is of first order in as, but of 
course the soft gluon emissions captured by the eikonal exponentiate. This is because 
potentially dangerous terms stemming from gluon correlations and from gluons radi- 
ating secondary gluons which may generate further large logarithms cancel due to the 
inclusive nature of the process [339, 863]. In any case, a simple argument can be made 
that typically non-exponentiating contributions are colour suppressed for sufficiently 
inclusive observables. This means that by evaluating the eikonal contribution to first 
order, as below, the leading logarithmic behaviour can be deduced. 


5.2.4.2 The eikonal approximation and the emission phase space 


The eikonal cross-section for the emission of a soft gluon with momentum k off incom- 
ing quark lines with momenta pa and py is given by 


dk p ey? 
dw(k) = -(4m)as Cr Sa (2 2) , (5.111) 


cf. Eq. (2.14), where e€ is the energy of the emitted gluon. Soft-gluon unitarity also 
fixes the virtual contribution w? through 


w+ f aw(h) = 0. (5.112) 


The overall contribution to first order in a, from the eikonals therefore reduces to a 
correction factor Wiz to the cross-section given by 


Weik = (1 +w°) 6(1—7) + [aways (1-7 =) (5.113) 
where F is the energy of the incident quark. Taking the moments of this term yields 


the Mellin transform of WP") 


ig to first order in ag, 
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1 qd Q 
res] 4C N_4 ak? 
My [wi 4 eee [ae T I welk). (5.114) 


ay k? 
0 (1—z)Q? 


Here 1 — z = €/E has been identified and, as before, kų denotes the transverse mo- 
mentum. It is related to the momentum transfer along the incoming hard propagator, 
which for emissions off parton a is given by 


ki 


2 2 
= a— k| & 
q |(p )| I? 


(5.115) 


The 1 of the numerator (z% — 1) in Eq. (5.114) accounts for the virtual contribution. 
As the argument of the running coupling the transverse momentum of the emitted 
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gluon is being used, i.e. (1 — z)q?. This choice is similar to the ones made before, 
resumming leading IR singularities. 

The phase space available for the emissions is defined by the maximal interval of 
virtualities allowed for the intermediate propagator — the emission of the gluon with 
energy fraction (1 — z) induces a maximal virtual mass of (1 — z)Q? in the propaga- 
tor. The lower limit is related to the regularization of the collinear singularities. For 
inclusive processes, where no additional conditions are imposed on, e.g., the rapidity 
of the final state, the largest scale available in the process, namely Q?, serves as a 
meaningful cut-off. 


5.2.4.3 Leading logarithms from the eikonal 


To obtain the leading logarithmic contribution only, a, can be taken as scale-independent, 
reducing the otherwise tricky integral in Eq. (5.114) to 


dis. we Hug 2 ae 
(thres) |`? _ F Z = ae 
Ma [Wa = ae fe Eo 
0 —2)Q? i 
5 a log(1 — 
_ = fez + logi — 2) a see a ja ae) 2| 
l-z a 
4Cra 2 
= = a lose + ¢(2) + lo) +a ) : (5.116) 


In the limit of large N, therefore, 


12.22 [vonon pona] 


Cras [1 
ane E + (0g N + e)! . 
T 6 


Mix a Ne 


19 


(5.117) 


Alternatively, directly taking the large-N limit of the integrand in Eq. (5.114) amounts 
to replacing 


1 
“18 e(1-5-2), (5.118) 
thereby constraining the phase space for the energy integral. This results in 


1 


(1, alt) is 
My wae N srs fo TOLLER] (S) (1 x :) 


l-z 
0 


x 


O 2e Ja log(1 — z) AC ras 
0 


= log? N 
l-z 4T 96 f 
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the leading logarithmic term. These terms can be exponentiated, to yield the resummed 
result at leading logarithmic accuracy, 


res (LE) C S A 
My [wi 4 = exp { Ce EF + (og. +12)" l (5.120) 


tj 


5.2.4.4 Sub-leading logarithms 


For sub-leading contributions, the running of as has to be taken into account and the 
global soft enhancement terms proportional to the soft pole in the splitting function 
must be included as well. In addition, at higher order, i.e., starting at O (a2) new 
structures are emerging which reflect the fact that the emitted gluons skew the original 
colour flow by adding new directions for eikonals. Such contributions are encoded in 
terms D(k?). For cases, where there are also coloured particles in the final state, of 
course, also B(k,) terms would appear. They are absent for initial-state emissions 
only, since they relate to the non-soft remainders of splitting functions, once the soft, 
eikonal-type terms are subtracted. As such they have of course no support in the 
drastically reduced initial-state phase space which actually gives rise to the threshold 
logarithms in the first place. In other words, the phase-space constraint effectively 
squeezes them out in the initial state. Ignoring the case of coloured final-state particles 
and concentrating on colour-singlet production, therefore 


ie ag he alae 
res zZ = 
Mns [WE] =e} [a S F ADER] p, 
0 (1-z)Q? E 


(5.121) 
similar to the case of Qr resummation, but with the B terms replaced by the D terms. 
To further stress the analogy, also for threshold resummation the functions A and 

D can be expanded in powers of the strong coupling as 


A(u?) = S A™ (sy and D(u’) = 2 D™ (2 (5.122) 
with 


AY =2Cp, AP = 20FK, and DY =0. (5.123) 


Not surprisingly, the global term K reads, as in the case of Qr resummation, 
67 r? 10 
K = Ca (=-=) -Trnj 124 
A (3 6 ) g (RMF: (5 ) 
cf. Eq. (5.65). 


From these considerations for Drell-Yan type, i.e. quark-induced processes, similar 
expressions for gluon-initiated processes can be deduced by replacing Cr with C4. 
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To obtain actual physical results in z-space from the Mellin transforms the back- 
transformation must be invoked, translating the expressions in terms of Mellin mo- 
ments N back to centre-of-mass energies. 


5.2.4.5 Comparing Qr and threshold resummation 


At first sight, the Sudakov form factor in Q7-resummation and the threshold radiation 
function are remarkably similar. They have a nearly identical structure with the same 
coefficients A. There are some differences, though, which reflect the different physical 
origin of the large logarithmic corrections. 

First of all, there is an explicit minus sign in front of the momentum-space integral 
in the Sudakov form factor of Eq. (5.60), tracing its origin to the suppression of 
hard radiation, while in threshold resummation the gluon contributions enhance the 
cross-section, as reflected by the positive exponent in the Sudakov form factor-like 
exponential. Second, sub-leading corrections in threshold resummation are related to 
either particles in the final state — a case not considered here in either of the schemes 
— or to the emergence of new directions after the emission of at least one parton. In 
Qr resummation, in contrast, sub-leading corrections emerge already at leading order 
in a, in the exponential and originate from non-soft collinear emissions. As stressed 
above, these have no phase space open in threshold resummation and are therefore 
absent there. As a consequence, even the first sub-leading corrections in threshold 
resummation are process-dependent while they are not in Qr resummation. 

Ultimately Q-resummation is being used to evaluate a kinematic distribution — 
the transverse momentum of a final state — without changing the fixed-order cross- 
section, while threshold resummation usually keeps the kinematic unchanged and alters 
the total partonic cross section. This latter feature actually allows to approximate 
higher-order corrections to the cross-section for the production of a given final state at 
hadron colliders that have hitherto not been evaluated, see for example [195]. These 
approximations do miss finite terms and rely on the assumption that the corrections 
in questions are dominated by the logarithmic structures in the threshold regime. 


5.2.5 The BFKL equation 
5.2.5.1 The high-energy limit: Large logarithms in 1/z 


Up to now, the resummation of logarithms has been discussed, where emitted gluons 
are collinear and/or soft and thereby alter the kinematics of a produced system or 
change the cross-section. The corresponding logarithms are of the form log (pz. x/ Mz) 
in the case of Qr resummation, or log(M%/8) in the case of threshold resumma- 
tion, with Mx the invariant mass of the produced system X and pr,x its transverse 
momentum. In this section another class of logarithms will be considered, which ap- 
pear more independent of the produced system and become important in a different 
kinematic situation. These logarithms originate from multiple soft emissions along 
t-channel propagators; they are of the form log(t/8). They become large when the 
invariant mass squared § of the overall scatter system increases for fixed momentum 
transfers f — a limit also known as the high-energy limit. 
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In contrast to the DGLAP evolution, which encodes the evolution of (collinear) 
logarithms in transverse momentum through a strict ordering of the emissions in trans- 
verse momentum, the high-energy logarithms emerge for emissions that have all about 
the same transverse momentum, but are ordered in rapidities. As a consequence, ap- 
plying this to PDFs for partons i in a hadron h, fi/n(x, Q?), the following picture 
emerges. The DGLAP equation accounts for the evolution of PDFs in the (trans- 
verse) momentum scale Q while the evolution equation related to the high-energy 
limit accounts for their evolution in 1/x for x + 0. This new evolution equation, 
the Balitsky—Fadin—Kuraev—Lipatov (BFKL) equation [189, 517, 708, 709], is 
therefore related to the rise of the gluon PDF for small x which ultimately drives the 
total cross-section and the structure function F> in this regime, see Chapter 6. In the 
following, the derivation of this equation will be briefly sketched, and the kinematical 
situation, where it is relevant, will be discussed, borrowing to a large extent from the 
very instructive review [451]. 


5.2.5.2 Connection of amplitude and cross-section: Optical theorem 


In preparation for the derivation of the BFKL equation, some manipulations are nec- 
essary, which will only be sketched here. They are based on various properties of the 
S-matrix such as analyticity or more generic properties such as the optical theorem. 
It relates the total cross-section to the elastic or forward scattering amplitude. 

The starting point is the S matrix, which relates initial and final states of a reaction: 


lf) = Sle). (5.125) 


This operator decomposes into a non-interaction part, effectively a ôf; = 1, and an 
actual scattering part, ; 
S = 1 4717. (5.126) 


Demanding unitarity of the S matrix, i.e., probability conservation of the theory, 
results in a oe a fan 
1 Ł ôtô = 1+i(f- ft) 40 (5.127) 
or, schematically, 
if = (t- 7") = 3n(P). (5.128) 


Using momentum conservation on the T matrix, and decomposing into individual 
entries, 


GÊ = Tye = 2r (SO vt - oe) Tr (5.129) 


implies 
(TT) = D [eE (Teh Let) Tata] 100 


summing over all possible states |n). Equating initial and final state — tacitly assuming 
that in fact the scattering processes discussed here have two initial-state particles, the 
two initial hadrons — and making contact with observables yields 
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tot = - EE en" yee" (Sot - Yr") ca Ta x 5 IT) (5.131) 


n 


the imaginary part of the forward elastic amplitude. It has to be forward and elastic 
since the identity |f) = |i) also implies an identity of momenta. This relation is at the 
heart of the Cutkosky rules. 
Denoting elastic 2 — 2 scattering amplitudes, e.g. qq > qq, aq’ > aq’, Or 9g > 99, 
with A(8, Ê), this means that 
T = ee) -^ Ceu (5.132) 


Below thresholds, e.g. from bound states, there are no imaginary contributions, im- 
plying that there is a region along the ŝ axis where the amplitude is purely real. In 
this region the Schwartz reflection principle can be employed, which asserts that 


A* (3, Â = A(s*, Ê (5.133) 


connecting the imaginary part of the amplitude with its s-channel discontinuity 


malas lim A(S + ie, t) z — ie, t) 


= Disc[ A(s, Ê)]. (5.134) 


The analyticity of the S matrix allows to rewrite the elastic amplitude through 
dispersion relations involving the discontinuity as 


A(s, 8) = if ds’ Disc[A(s’, Ò] ee Disc[A(s’, Ð (5.135) 


A pa j ° A 
201 s'—§ 201 s' —§ 
—oo -_t 


This dispersion relation can also be rewritten as an integral in the plane of the (po- 
tentially complex) cosine of the scattering angle, 


2 
z =— cosh, = — (1 i =). (5.136) 


namely, 


A(é, 8) = / dz; Disc A(z, Ò] ee Disc[ A(z}, Ð (5.137) 


2ri zaz l dri he — 
—oo 1 

This form documents the impact of the unphysical region of the scattering on the 

amplitude, which is entirely a consequence of the analytic properties of the S matrix: 

the integral is over the unphysical region of the scattering, i.e. outside the interval 

+ € [-1, 1]. This consequently implies that all discontinuities or singularities in the 

scattering amplitude, those that impact on the cross-section, are in the unphysical 
regime of the scattering. 
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Assume that the amplitude can be decomposed according to a partial wave expan- 
sion, 
AlS, È = S021 + DACS, ÊP (2), (5.138) 
l 
with the P, the usual Legendre polynomials. Using Eq. (5.137) and the Legendre 
function 


1 
1 dz’ 
Qe) = 5 f Ae (5.139) 
allows to rewrite the lt? partial wave as 
a 14+L T de! 1 ; 1 F 
Ai(8, È) = [1 + (-1)'*"] zg) Dis AC’, D). (5.140) 


1 


Here L is the related to the overall parity of the amplitude. In Section 7.1 the 
Sommerfeld—Watson transformation will be introduced in more detail. Here it 
suffices to state that it essentially replaces the sum over discrete values | of angu- 
lar momentum parameter with an integral, thereby rendering l a continuous complex 
parameter. After the corresponding transformations the equations above, in the high- 
energy limit 


28 
> z — o, (5.141) 
ultimately become 
Ea oo de! P j 
AG, À = f deat 41) [1+ (=1)+] / z P= )Qul2e) picctaco!, D] 
2i 2ri  sin(rl) 
ô— ioo 
d+ioo 
1 Ree era 
—— dj ———____——. e F(t). 142 
dg sin(l) esa) pa) 
6-100 


” Vr r(i+1) 
Q(z) —> U E (5.143) 


have been used. The relevant quantity thus is the Laplace transform 
Fi(t) = / dye~"Y Disc[A(zi, Ô] (5.144) 
0 
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of the discontinuity, with the Laplace parameter y = log(z;/2). This quantity will be 
evaluated later on. 


5.2.5.3 Kinematics of elastic parton—parton scattering 


To start the discussion of how the large logarithms log(t/s) emerge in perturbative 
QCD it is useful to reformulate the kinematics of elastic parton—parton scattering such 
that relevant limits can be inspected in a transparent way. A very handsome tool for 
this purpose is the Sudakov or light-cone decomposition, in which momenta are 
written as 

p" = aP + BPË + ph (5.145) 


with two light-cone axes P} and the assumption that the transverse plane indicated 
with the subscript L is orthogonal to both these axes. For the discussion of cross- 
sections at collider experiments, it is sensible to choose these two axes as the beam 
axes P4 and Pp. In the case of parton elastic scattering, e.g. qq’ > qq’, the incoming 
momenta are then given by 


Pa = V8 (xa, 0; 0) and p = V/s Q TB; 0) ; (5.146) 


making it explicit that pa || Pa and py || Pg, without any transverse momentum. The 
outgoing parton momenta are expressed by their two-dimensional transverse momenta 
k1, and rapidities y; as 


= (koi + ks. ko,i — Raa: k.,) mo (kiue, ki jen; kai) , (5.147) 


where k1, is the absolute value of k l,i- In these coordinates, the metric tensor is 
decomposed into longitudinal and transverse components according to 


[Lal V pë 
Py + PaP, 
g” = a ap 


(5.148) 


where 6/” acts as a Kronecker-d in the x — y plane. 

Expressed by transverse momenta and rapidities, the phase-space element reads 
dk = d?k, dy 

(Qr)2(2E) (2r)? 4r ` 


(5.149) 


In2—-2 scattering with Pa + Po — po + pı, Momentum conservation in the transverse 


plane requires ki. o= = =k, 1 and therefore lku, o| = lku, 1| = kı. Introducing 
S + 1 
g = A = 5 log(z4/2B) (5.150) 


as the rapidity of the overall system and the rapidity distance of the outgoing partons 
from the centre-of-mass rapidity 
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y* = “> (5.151) 
allows the rewriting 
k Dki = 
LA = RT (e+e) = i e cosh y* 
k 2k 7 
tB = J (e= fem) = “= eT? coshy*. (5.152) 


Vs 


The Mandelstam parameters therefore read 
ê = 4k? cosh? y*, Ê = —2k? coshy*e-Y , and û = —2k? coshy*e” . (5.153) 


Using these expressions on the squared leading-order matrix element yields 


1 2_ OREU _ Ch Acosh?y* +e" Ch (yw)? 
(5.154) 
Therefore, 
2 
egg’ —raa' = (Anas)? |Maq’ad'| = TORAS ev. (5.155) 
di 1678? 64ki 


The factorization formula, Eq. (2.52), relates the matrix element to the corre- 
sponding fully differential cross-section, in the case of elastic quark—quark scattering: 


1 1 
g= i da / dig faja(@as Up) fq/B (eB, HE) Ogg saa (UP; HR) - (5.156) 
CA CB 


As usual, at leading order the parton distribution function fy/4(aa, uz) param- 
eterizes the probability to find a parton of type a in the beam particle A, with a 
line-cone momentum fraction x4, and at the factorization scale up. In order to 
actually detect the two outgoing partons, they must carry some minimal energy and 
momentum, thus implying a minimal momentum fraction ¢ w.r.t. each of the beams. 
Further emissions of course would make this picture more complicated and the trivial 
leading order partonic cross-section here would start to develop a more complicated 
kinematic dependence. For example, the integration over the x would allow further 
real emissions to be included in the partonic cross-section 6, which would need more 
energy and momentum than the ¢ which define the kinematics of the two outgoing 
quarks, and a dependence on the ratios ¢4,8/2,4,B would emerge. However, using the 
relations above, Eq. (5.156) can be cast into 


do qq’ aq! 
dk? dyo dyı 


= tafasa(©a, Up) te fo/p(@B, Up) 


Taz 


= tafasa(ta, Mb) eB fo/e (eB, Up) 36h cae (5.157) 
L 
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This becomes large for large rapidity distances y*. In this limit, § ~ —û >> —t#, and 
in fact, 


i N 
y > 5log (-5) ; (5.158) 


indicating that the large rapidity distance limit is identical to the high-energy limit. 
Replacing & with —8 and retaining only the leading terms of order 8?/t?, stemming 
from gluon exchange in the t-channel, yields the following squared matrix elements in 
the high-energy limit: 


C2 4 g2 
2 = ae = 2 _ VFJ 
IMagraq'l = Magsagl = Mazal = “9 p 
2 CrC, 4 8 
IM gag! = 5) Is Fp 
C2 4 62 
[Mogo = oS (5.159) 


Forming ratios shows that replacing a quark by a gluon amounts to multiplication 
with a factor C4/Cr = 9/4, as expected. 


5.2.5.4 Rewriting in the high-energy limit 


To arrive at the full logarithmic structure, further emissions need to be considered. In a 
first step the qq’ —> qq’ scattering amplitude will be rewritten in a fashion better suited 
for the following discussion. The full tree-level amplitude for q(pa)q' (pè) > q(ko)q' (kı) 
reads 

igh” 


Magad = (ho) (—ig.$) py (06) î [ast ioti ao) , (5.160) 


which is not very helpful for the further discussion. The reason for this is that the 
form of the metric tensor in the gluon propagator suggests that the calculation is 
being performed in Lorenz-gauge — instead it will prove advantageous to use light- 
cone coordinates and a physical axial gauge. 

Using the decomposition Eq. (5.148) for the metric tensor allows the identification 
of the leading contributions to the interaction. This is given by a structure where the 
helicity-conserving part of the quark-gluon coupling will be contracted with one of the 
light-cone-like polarizations of the t-channel gluon, while the other light-cone polariza- 
tion vanishes due to the equation of motion p,ti(pa) = 0. In addition, the transverse 
polarizations will induce a kinematically suppressed helicity flip, usually identified 
with a “magnetic” interaction.” Ignoring these sub-leading or vanishing terms, the 
amplitude thus becomes 


5 An alternative way to see this goes as follows. In the soft limit, where the momentum transfer 
q along the gluon propagator is small, terms such as U(pa + q)ypu(pa) can be approximated by the 
helicity conserving eikonal 2py,a, 


U(pa + Q)Ypu(pa) > 2Pp,a 


thereby eliminating the terms proportional to ph and 6/". 


312 QCD to All Orders 
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Po, b, Hb kı, dy, 41 


ors, 


Fig. 5.6 Feynman diagrams for gg — gg scattering at leading order. In 
the upper left the t-channel exchange diagram, relevant in the high-energy 
limit, is depicted including all momentum, colour, and Lorentz-index as- 
signments. The u-channel is shown in the upper right diagram, while the 
s-channel and four-gluon vertex diagrams are in the lower row. 


4 Loy f 
PiP, pa [aste iTe wa) 


(5.161) 
Squaring the amplitude and summing (averaging) over final (initial) state colours and 
spins yields 


Maq'aa = |tilko)(—tgsT iG) Yt; (r)| 


2 4g4(Tr[T*T*])” 16 [2(kops)(Pape)] [2(k1Pa) (Papo) 


|[Maa'—+aq’ = 4.9 §2f2 
4 T gab 2 AG 8 4 2 8 4 32 

eS a a (5.162) 
9 a 9 £ 9 # 


reproducing as anticipated the matrix element in the high-energy limit. 
A similar treatment can be applied to gluon-gluon scattering, which is dominated 
in the high-energy limit by the t-channel diagram shown in the upper left of Fig. 5.6, 


Mgg-+99 = iggf ua (Pa + Role + Guo —ho +0), + Iéna (—4 — Peso 
“igus = clon 9) ia + Ieu (q + i) ie + Inim (ki — m) 


w EKE” (Da)ey?” (po)eho (koe? (k1) 
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ae . adoc bdic 2S pF Ha* „Hb* Mo H1 
S i | 2gs f° Gu oPé,a | | 29s fS Jur we PC. gf | Oa s hos 
28 
. doc pbd a* Hy 
> —ig; f° SJ Gra to Ip m 7 i ee ey enon * (5.163) 


Here the first two lines encode the triple-gluon vertices, q = pa — ko with Ê = q? 


denoting the momentum transfer along the t-channel propagator. In the last line before 
the polarization vectors the metric tensor is already rewritten in the axial gauge, 
omitting sub-leading or vanishing contributions following the same logic as above. To 
be more specific, in the limit of small momentum transfer pa % kg and terms such as 
ko + €(pa) Or Pa + €(ko) therefore are small and can be neglected. This reasoning, again, 
reduces the triple-gluon vertices to helicity-conserving eikonal terms to be convoluted 
with a single light-cone polarization of the t-channel gluon. These manipulations of 
the elastic gluon-gluon amplitude, namely the restriction on t-channel exchange only, 
supplemented with rewriting the polarization vectors and the metric tensor, violate 
gauge invariance. This is easy to see by replacing any of the polarization vectors, 
for instance e(pa), with the corresponding momentum (pa in the example) and by 
realising that this will not make the amplitude in Eq. (5.163) vanish. In order to 
restore full gauge invariance, all contributions to the amplitude and all diagrams, i.e., 
s- and u-channel exchange and the four—gluon vertex, must be included. The complete 
gauge-invariant set is shown in Fig. 5.6. 

However, when consistently working in the high-energy limit and in a physical 
gauge, the amplitude above is sufficient. With the squared colour factor given by 


fotoe poituc fdoac' parker = OF (N2 _ 1) (5.165) 


6In order to understand this in more detail, the sum over the physical helicities À of the external 
gluons is cast into 


+ $ + 
1 np + nt pl n2 pH pt 
XO elp)" (p) = — | ot” i l 
retest (n-p) (n- p)? 


where n denotes an arbitrary four-vector, acting as the gauge vector in axial gauge. This vector can be 
chosen in a convenient way. For example, by setting n = pp for the incoming gluon a with momentum 
Pa the sum above becomes 


uB wow 
! 1 PpPa +P, P ! 
Sok, Paley! (pa) = — | gt -224m H] = oe, 


Xa 3 


effectively the ô over the transverse polarizations from above. Contracting the polarization sums of the 
incoming gluon pa and the outgoing gluon kg with a metric tensor yields the two physical polarizations 
plus terms that vanish in the high-energy limit £/3 — 0, thus explicitly encoding helicity conservation 
in this limit: 

f 


x x A t 
Guano Iu! uh | >_ 2 (Paley, (pa) | |X Ke (koe sh? (ko) | = 2fi+o(5)| (5.164) 


AG Xo 


This means that in the high-energy limit the Lorentz structures of the triple gluon vertices can be 
simplified such that only the terms with the metric tensor between incoming and outgoing gluons 
remain. 
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and the relation in Eq. (5.164), the summed and averaged amplitude squared reads 


_ 2 40% g* 5 99% 8? 
IMsoaq| ae a ee Te (5.166) 


as expected. 
Turning this into a partonic cross-section necessitates folding the matrix elements 
squared with suitable phase-space elements; for a two-body final state 


do dyodkî o dyıdkî ı 
2 An(2r)? 4r(27)? 
1 cosh(yo — y1) +1 dkĉ o dk ı 


= 252 (7 7 x 
= 28 sinh(yo— yı) (27)? (277)? (27)"6 (Eo +E) x 


; (27)*64 (Pa + Po — ko — kı) 


1 dk? 
28 (2r)? ’ 
(5.167) 


where the large-energy limit has been used, i.e., a strong ordering of rapidities yo > y1, 
leading to the ratio involving the hyperbolic functions to approach unity. In addition, 
the conservation of transverse momentum leads to the ki,o = k1, = k1. Conse- 
quently, the differential partonic cross-section reads 


(5.168) 


5.2.5.5 Real corrections: Multi-Regge kinematics 


In a first step to an all-orders resummation, consider the kinematics of a 2 > n+2 gluon 
scattering, where the outgoing momenta are, again, labelled by k; with i € [0,n + 1]. 
Four-momentum conservation requires 


n+1 n+1 n+1 


z 5 kie” krie” 
0 = Kits ha i , and xy = ama (5.169) 
2 2 V ava 


as a straightforward generalization of the 2 > 2 case encountered above, cf. Eq. (5.152). 
In a fashion similar to Eq. (5.153), various Mandelstam invariants are given by 


n+1 
Laps = ` ki ikp je Uw) = ky oki no evo 94 


i,j=0 


U> 
II 


Sij = 2kikj = Qk ikLlj cosh = yj) = cos(¢; = Qj) y ki ska jel 


n+1 
fas = —2paki = -k aki je UY) a —hy obs gee ™ 
j=0 
n+1 
fs = —2pki = — X` ki aki ge œ ki akinpe I, (5.170) 
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where multi-Regge kinematics is being assumed in order to make the approximations, 
ie. a strict ordering of rapidities while the transverse momenta are all small and of 
the same size, 


yo > yı > y2 D- > Yn > Ynyı and kıa S kL Ve. (5.171) 
This leads to a strict hierarchy of scales, namely 
oS êy D> ki. (5.172) 


Assuming, as before, t-channel dominance, the momentum qı going through the 
first gluon propagator is given by 


k Yo k yı a = 
vs ( a T es 0: 0) z (knoe, kanem; Eas) 


= (krae, k1 oe™; -E1 o) - (5.173) 


Q 


qı = Pa — ko 


Therefore, the squared momentum reads 
q = Ê = kinaki oe” — kio © kio = da (5.174) 


and in this limit the scales in the propagators are driven by the transverse compo- 
nents of their four-momenta only. Of course, the same reasoning applies for the next 
propagator, q2 = Pa — ko — kı, such that, in general 


Ê =e x -4f i. (5.175) 


5.2.5.6 Real corrections: 2 — 3 gluon amplitude 


The 2 > 3 gluon scattering amplitude in the high-energy limit, cf. the left diagram in 
Fig. 5.7, is given by 


M g9-+999 


= (2igs f° a, ote) 


she 


 (—igs f2") eax 2) 4 Geguak q2 4 ki de, + gug (ki — ee 


; ade 1 
: (2igsf d ia) F , (5.176) 
2 


where the first and the last line are the by-now familiar eikonal factors for the emission 
of a soft gluon off the incident gluon line, and the second line stands for the additional 
triple-gluon vertex sandwiched between the two t-channel propagators. Multiplying 
this term with the “pending” four-momenta from the external vertices and suitably 
normalising allows to define an effective vertex, namely 
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Pa, Q, ba ko, do, Ho Pa; Q, Ha ko, do, po Pa; Q, Ha ko, do, po 


kı, dy, p1 kı, dı, 41 
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Po, b, po k2, do, 2 Po, b, Ho ky, d2, p2 Po, b, Ho k2, da, p2 


Fig. 5.7 Example Feynman diagrams for gg —> ggg scattering at lead- 
ing order. In the left diagram the t-channel exchange diagram, relevant 
in the high-energy limit, is depicted including all momentum, colour, and 
Lorentz-index assignments, Bremsstrahlung contributions from gluon emis- 
sion off the upper gluon line are also depicted. 


~ 2 
Cu = aie PS Ierta (Q1 + Q2)m + Gouri (742 + Rider + Imta (ki — Men 
tat ty 
~ (q Pte) i = “a Diui + Paai (5.177) 


where the terms proportional to the soft gluon momenta in the propagators have 
been neglected in the scalar products of the already sub-leading contributions, and it 
has been assumed that the qı1,2 are dominated by their transverse components. The 
Bremsstrahlung-like contributions from gluon emissions off the upper or lower gluon 
line, cf. the right two diagrams in Fig. 5.7 for emissions off the upper gluon line, can 
also be absorbed into the Lipatov effective vertex 


Êu 26 t 2f 
CH: (q1, q2) = (qı + q2) ( a ' 2) pp ' ( 7 Ta E) oe (5.178) 
b1 


The amplitude in the high-energy limit thus takes a particularly simple form, namely 


M 9-999 = 218 €*™ (pa)et”* (pp) (ko) (k1 Je’? (ka) 


. a C 1 . C C 1 ii a €: 
x fist K Gras A fiset 10 (q, a) a fist do Spas . (5.179) 
1 2 


The Lipatov effective vertex is manifestly gauge-invariant, as can be seen by con- 
tracting it with kı instead of the polarization vector of the gluon. This allows simple 
contraction of this effective vertex through the ordinary metric tensor for e(k1) when 
squaring the amplitude, leading to 


—+tg+t, + > 
28 ani taito1 


faite: > 5  2fib28 
CHC, = (qı H q2)4 ( ) 
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kii gati. 44i 101.9 5.180 
2 k2 7 k2 (3180) 
‘Lj ates 


x 21. 191,2 


where the relations of Eq. (5.170) have been combined into 
taitoi © ki oki ngiki eTo & ki as (5.181) 
This ultimately allows the high-energy amplitude squared to be cast into the form 


16C% gf 3 
N2—1 ki oki aki» 


Mi E (5.182) 


In fact this result is reproduced, in the limit of strong rapidity ordering, by the exact 
squared amplitude for gg —> ggg, 


TA 1 
EET ere = A(masCa)® > 8; 5 er o (5.183) 


§a08018128208 
ij  non—eycl °2090191252b5ba 


where i,j € {a,0,1,2,6} and the second sum goes over all non-cyclic permutations of 
this set [450]. 

The approximated result can also be compared with the corresponding gg > gg 
amplitude squared which reads 


4Chgs 
N2 -1 kf oki a 


[Musal = (5.184) 


cf. Eq. (5.166) and replacing f? in the denominator with k? ok7 1, following the logic 
of the high-energy approximation. This gives rise to the — correct — suspicion that 
additional gluons emitted along the t-channel ladder will lead to factors 4C4g?/kf ; 
and thereby to logarithmic corrections of the type log(8/t;). 

A second comment is in order here. Inspecting the three-body phase-space element, 
written in convenient coordinates, 


5 dyjd2ky i 2 
_ i E ; 44 2 y 
dbs = i ati ays (rem Se) 


i=0 


1 dki o dyd?k, 1 dki Safes 
= 35 Qa)? aay Gaye aS (>: hs) ete) 
it becomes apparent that there is not explicit rapidity dependence on the emission of 
the extra gluon; relieving the strong-ordering condition it will be emitted with a flat 
probability anywhere between the two most forward gluons, 0 and 2. This indepen- 
dence will remain also for further gluons, with the only constraint imposed by rapidity 
ordering; other than that, the probabilities for gluon emissions will be flat over rapidi- 
ties. This maybe surprising pattern will change when sub-leading corrections are taken 
into account, a complication well beyond the scope of the discussion here. Following 
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the findings until now, the differential partonic cross-section is given by 


I g9+999 Chae [lyo — y2] 
2 2 > 
dki odki ado 4m k3 ok? a (K3 o +k? o+ 2ka ok1,2cos goz) 


, (5.186) 


where |yo — y2| is the rapidity interval gluon 1 is allowed to populate, and ¢o2 is the 
azimuthal angle between the gluons 0 and 2. With rapidities given by log(ŝ/k? ) the 
logarithmic enhancement of the production cross-section of three gluons with respect 
to the production of two gluons at the same rapidities yo and y2 becomes manifest. 
This provides motivation for trying to resum this new class of large logarithms, which 
have their origin in rapidities (or as log(1/z)) in contrast to the usual collinear double 
and single logarithms that are resummed by the DGLAP equation. 


5.2.5.7 Virtual corrections 


However, in order to fully appreciate the emerging logarithmic structure, virtual cor- 
rections need to be considered, too. In the large-rapidity limit, the relevant corrections 
are depicted in Fig. 5.8. 


Pay, Ha Pa +l, a' ko, do, po Pa, Q, Ha ko, do, Ho 


Po, b, po —p +10! kı, di, oa 


Fig. 5.8 Feynman diagrams for virtual corrections to gg —> gg scattering 
in the high-energy limit. In the left diagram all momentum, colour, and 
Lorentz-index assignments have been included. The result for the crossed 
diagram on the right can be obtained from the one for the left diagram 
with suitable replacements. 


Omitting from the start contributions of the form e(p) - p, approximating Pa ~ ko 
and p © kı, and realising that the dominant contributions stems from small loop 
momenta, which allows us to neglect terms œx l, the amplitude can be cast into 


M = Epas (Pa) Eup» (Pb) Eno (Ko) En: (k1) 
4 4 faa'c fa'doc' ¢cb’b fe'dıb' 
fe Pr 7 


27)* | (pa +171? (pp — 1)? (q — 1)? Fae Sas Poa Seg oi 
x | (zane — Qgtta! Malt [oa 4. gat Ha (| — pe") 


x (Ga (Da + ko +1) + gH} (pa — 2ko Lhe = aghorto he] 
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x (om (l = 2pp)he” + greo" (pp + Drv = Qgho Bo! m | 


x [(a (pa — ko +L kt” pid- )] | 


X Ena (pa) Epy* (pv) Epo (ko) Eui (kı) 


d4] gt fare fo doc’ fobb fe'dıb' 
i i (r)i 


tipt 


(Da +IPP(pp -P(g Ireo Inani Irini Indui 


x | (zat a ghortapte')| (2am pner = goror pte )| 


x (G ee 2gimnhe )| (Gas = agrs pg )| ) 
— gafe epa doe Fok he aie . T3 (5.187) 


where in the last step the decomposition 


Pu,aPv,b + Pu,bPv,a 


Juv = 2 O (5.188) 


has been used and where the remaining integral Z is given by 


L= 


s f d4! | 1 1 1 1 | 
S . . . 
(274) | (pa +1)? P (p-t)? (4-1)? 
age f dodpd?ku 1 1 
(27)4 (1+a)b-— k? +ie aps- k? +ie 


II 


1 1 
“a(b —1)8— k? Fie ia a i 
(5.189) 


To obtain the expression in the second line, the loop momentum / has been decomposed 
according to 


IH = apt + Bp! +h and dfl = sdodsd*ly . (5.190) 


In the final expression for Z above, the actual pole structure of the propagators has 
been made explicit, since this allows the analytic structure to be analysed and ulti- 
mately the integrations to be performed. This is more or less straightforward, since 
closer inspection reveals that the a integration of the integral in Eq. (5.189) is not too 
hard: all propagators apart from the third one will have a pole in the lower complex 
half-plane of a and the a integration may easily be performed in the upper half-plane: 


d8d?k, | 1 1 1 


T x —2i8? . . ; 
| Qn |B- k tie B+ie (q ki) tie 


(5.191) 
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leaving a logarithmic integral over 6 in the range 8 € [k7 /8, 1], resulting in 


Mx g FO g OPI Se (Pa) Eux (Pe) Euo (ko) Eu (kı) 
8 A 


1 s 1 $ 1 , 1 1 s 
6ra 3 fer epe doc fe Pye dib . a2 log a a(t) f (5.192) 
Ca —t —t 


x 


with the characteristic function 
dki 1 
(27)? kî (q -k)i 


which encodes the residual loop integration in the transverse plane. This function 
could be regularized through an infrared cut-off u, resulting in 


ae Sa J (5.193) 


2 
a(é) 2 — 274 ogh, (5.194) 
4r L? 
showing that the amplitude is doubly logarithmic-divergent. 

However, the second, crossed diagram in Fig. 5.8 needs to be considered as well. 
This is relatively straightforward, since the dominant term can be obtained by just 
replacing ê — é, and using that û = —ŝ — Ê ~ —ŝ in the usual limit ê > 0. The 
overall amplitude thus reads 


l6Tas 8 f aa'c ga'doc’ 
i ~ Ca ge) g” gP epay Eup * Euo Em f f a 
x |log = ‘ie bale (x = m in) foe ad . (6.195) 


It is interesting to note, though, that in the calculation here, self-energy and vertex- 
correction diagrams have been omitted: their dominant contribution would consist of 
a running of as, which, of course, could be inserted “by hand” as well. 

However, this leaves some colour algebra as a final task. In the final step for the 
virtual amplitude, the leading contribution corresponding to one single simple colour 
structure will be extracted. To this end, it is useful to recall that the underlying 
problem is the determination of colour structures in the scattering of two gluons, 
of 8 ® 8 in the adjoint representation of QCD. It is important to remember that 
the generators of the adjoint representation, 8, indeed are anti-symmetric. Broadly 
speaking, therefore, two kinds of structures emerge, symmetric and asymmetric ones, 


8 @8 = [88 Big + [8 @8], , (5.196) 
given by 


[8 88]; = 198s 6 27 
[8@ 8], = 84 010010. (5.197) 
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The products of structure constants f% of the results are mapped onto these struc- 
tures through projectors Pee, given by 


Da 1 a 
Pat (1) = N2—1 ô D Spa, 
Da 1 ac 
Pa (8A) = wif do feed 
x — 1 1 
PEs (10 @ 10) = 5 (80%, ) — gy FFs» (5.198) 


while the other symmetric structures do not contribute any logarithmically enhanced 
terms, since the colour structures between s- and u-channel are antisymmetric and 
the two contributions differ only by finite terms. Furthermore, the projector for 10 $ 
10 cancels exactly when contracted with the colour structures in the result above, 
Eq. (5.195). This leaves, at leading colour, only the anti-symmetric octet contribution, 
thereby effectively reducing the result to 


S log = gitato grem fedo foed . (5.199) 


M ~ —8ras a(t) 7 


5.2.5.8 Putting it all together: the 2 —> n gluon amplitude 


The structure of the leading terms of the 2 — 3 gluon amplitude in multi-Regge 
kinematics, Eq. (5.179), lends itself to a generalization to the 2 — n gluon case in 
the same kinematical situation, as advocated in Ref. [708]. This form can be further 
supplemented with the leading higher-order virtual corrections from above, assum- 
ing that the dominant octet structure encountered there also constitutes the leading 
terms for the 2 > n gluon amplitude. From a Lorentz and colour structure point of 
view the virtual corrections look exactly like the exchange of a single t-channel gluon, 
which motivated the conjecture that exponentiating this correction would constitute 
the all-orders leading logarithmic approximation to a t-channel gluon exchange [728]. 
Applying this to all t-channel gluons amounts to replacing their simple propagators 
with 


1 1 er cee | A 
i, > i, g ( i, ) oo i, exp fot w| ’ (5.200) 
using the approximated expressions in Eq. (5.170). 

Recalling the form of the function a(t), the exponential on the right hand side of 
Eq. (5.200) has a remarkable similarity to a Sudakov form factor driven by the product 
of two logarithmic structures. First, there is a collinear logarithm, which manifests 
itself as a logarithm of the cut-off u, log(q7 /u?). Sending this cut-off to zero would of 
course lead to the function diverging, and, as in similar situations before, ~ must be 
interpreted as a resolution parameter. Without any limit on resolution, the probability 
for not emitting a parton is zero. The other logarithm is written as rapidity differences 
(yi-1 — yi) which in fact are nothing but log(§;_1,;/ t;). These terms are different from 
the ones encountered before and emerge as large contributions only in the high-energy 
limit. 
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Assembling all contributions into one expression, the 2 > (n + 2) gluon amplitude 
in the high-energy limit reads 


M gg (n+2)9 = 218 int Guar e@* (pq eH” (ko) 


x E exp fedt — yı) 


x [iseset Ca (a, TEA 


x 5 exp feon — y2) 


e”? (k2) 


bs Es Ds LP 
om 


x int" C (q2, q3) 


1 a 
TE fetio Ss ims) 
tn41 


x C €M>* (pp) t! (kn41). (5.201) 


Apart from the exponential factors attached to the t-channel gluon propagators re- 
summing the leading virtual correction to all orders, this constitutes a straightforward 
generalization of the 2 > 3 gluon amplitude in Eq. (5.179). 


5.2.5.9 Evaluating the discontinuity 


In the next step the discontinuity of the elastic amplitude has to be evaluated. In this 
setup this translates to applying a cut on the s-channel gluons k, equivalent to putting 
them onto their mass-shell, i.e., to replace 


a 
ao 276(k?). (5.202) 
Thereby the (n + 1)-loop integral is effectively replaced with an integral over the 
(n + 2)-particle phase space: 


ee i / dík; TI eost) E TI dy; deki ya ya58 as 
n+2 — ne (27)4 cs J a= + Ar (2r)? P+Pb a i i 
(5.203) 
Being interested in the amplitude in the multi-regge regime, this can be recast into a 


form, where the conservation of longitudinal light-cone momentum is guaranteed by 
the two most forward outgoing gluons, 
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n+1 


1 @ 5 dyi dki, | dkn 
Pnz = | oi we ITT f S24 = me one 2m) ao tn) . (5.204) 


To obtain the discontinuity of the elastic gluon scattering amplitude with the exchange 
of two t-channel gluons, one must sum over all 2 > (n + 2) gluon amplitudes in 
Eq. (5.201), which ultimately translates into 


DiscjiM®?? 


E a 
7 -Sfi 1 a dyi hit | dkna (9 “(Yo 
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exp [a(Ens1)(Yn — Yn) | 7 — exp [a(s41)(Yn — mt) 
| |r| | 


tn+1 n+1 


x (igs JPH +) (igg fortran) , (5.205) 
where t = q? and the individual t; = (q — q)?. 


The products of Lipatov vertices C„C” have already been obtained in Eq. (5.180). 
Here they are generalized to 


(q—ait gti 1 + (4 qiz) a? 
C (qi, G41) Cus (9 — G9 — G41) = —2 | qf a 13 
qi — G+41)4, 


Projecting on colours — octet vs. singlet exchange along the two-gluon ladder — yields 
colour factors C, 


es ee for singlet (5.207) 


N,/2 for octet 
such that in total 


n+1 
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n+1 
1 
evk- 1—yr]la(fk)+a (Ê 
x AAC 2K( m, dm 
lag J ji (dm, q «| 


m=1 


(5.208) 


At this point the Laplace transform prepared in Eqs. (5.142) and (5.144) allows to 
disentangle the multiple integrals in a very convenient way. This is achieved by 


1. assuming a strong ordering of the rapidities, y; >> yi41, thereby enforcing the 
kinematical situation of the high-energy limit, cf. Eq. (5.171), 

2. integrating over rapidity differences y; — y;+1 instead of rapidities, and 

3. using the overall rapidity difference yo — yn+1 as a Laplace parameter. 


Then, the Laplace transform reads 


et 1 

x — = —~ (—2asC)K (qj, 4541) 
o z 

1 1 

x a = = ; (5.209) 

tntitng1 L> 1—a(tn41) — oth 41) 
which can be cast into 
5 ~ f Pare 1 
=? 3 a! i .21 
FA 2i(4rasC) Gn? @ ane fila, t), (5.210) 


where the function f, is a solution of the integral equation 


1 d?qa1 Km, 4) 
t) = a 1— 2a,C ; 
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95,1 (4 — 42)4, 
(5.211) 
This is the celebrated BFKL equation. 


5.2.5.10 Octet solution of the BFKL equation 


4 a first step, the octet solution of the BFKL equation is considered. Looking at 
q. (5.195) and taking into account that for symmetric octet exchange, the amplitude 
is ae under § <—+ @ implies that for this case a(f) = 0. Therefore 


A d?k 4 
(1-1) f(a, Ê = 1- aN | = BG p(k, t) (5.212) 


which yields 
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oc „Ê — foct k, ê = _ 
(a1) = #69 = a 


; (5.213) 
where the term a(¢) in the denominator stems from the integral. Pushing this through 
the Laplace transform, the amplitude for octet exchange is duly obtained, 


Met, (8, Ê) = 4na,N, a (1 +e) ay" (5.214) 


eae “sin[ra(t —t 


The exponent stems from the pole at 1 = 1+ a(t) in the complex angular momentum 
plane, which essentially yields the spin of the exchanged object. In Section 7.1.2 objects 
exchanged in the t-channel will be dubbed “reggeons”, and the exponent indicates 
their “equivalent” spin — essentially the intrinsic angular momentum related to their 
exchange which, of course, does not need to be half-integer or integer. Using the fact 
that a(t) = 0 for Ê = 0 shows that the reggeized gluon indeed has spin 1. In the high- 
energy limit, where f is small compared to 8 the amplitude thus can be approximated 
as 


Me, (8, Â = —8ragNe = vow) (5.215) 


as anticipated. 

It is worth noting that although having started the consideration with the discon- 
tinuity at O (a2), the overall result for the elastic amplitude is O (a2). This reflects 
the fact that of course the colour-octet exchange amplitude starts with a single gluon 
in the t-channel, which is at this very same perturbative order. 


5.2.5.11 Singlet solution of the BFKL equation — the perturbative 
pomeron 


After some substitutions, the BFKL equation for the exchange of a colour singlet at 
t = 0 is given by 


(1-1) a k) = a ZRI 


q? 2- psin. Gi, ale psin. 
+4a,Ne [Ss 8 ; k $ ’ k , 
Ga a i (qo, k) g F Gee (2). 1 (1; k) 
(5.216) 
where ff 88 (q,, k) is the differential form of the singlet function for ¢ = 0, 
sin, A dki Saini a 
rds t= 0) = J Qn)? T S(q, k,t=0). (5.217) 


The homogeneous part of the equation can be solved after a Fourier expansion, 
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tale (q, k 2. J dv a( (v, n) exp iv (108 — log = k? =) + in (o -4)| , 


n=—0o* 

(5.218) 
where (¢, — ¢) is the angle between qı and k in the transverse plane and yp is an 
arbitrary scale. Expanding the 6 function in a similar way results in 


di k2 i 
p aS (Qn)? DS J dv exp iv (o$ 5 — log 5) + in (o: -4)] ; 


nN=—OCO_ 


(5.219) 


i.e. the same expansion as for the homogeneous part, but with constant coefficients 
a(v, n) = 1/(47r?°k1qı,1). Inserting this expansion into the equation for f yields an 
equation for the coefficients, namely 


1 
i-1 ee ; 
( Ja(v, n) GE + w(y, nja(v, n), (5.220) 
and therefore 
(iH=—— i (5.221) 
Nae ~ Ankig 1 l-1—w(y, n)’ i 
After a bit of algebra, the eigenvalues w(v, n) can be written as 
2as Ne De 
wln, n) = -L Re Ẹ (ee siv) = vo) ; (5.222) 
T 
Here y(x) is the derivative of the logarithm of the T-function, 
dlog I(x) 
= 22 
p(x) Jz (5.223) 


The exchange of such a colour-singlet object in the t-channel is often identified 
with the exchange of a pomeron, which in turn is the driver of the total cross-section, 
cf. Sections 7.1.2 and 7.1.3. However, while in the simplistic picture advocated in 
this later part of the book, the pomeron is assumed to be a simple pole, this is not 
the case in perturbative QCD, studied here. Here, it is important to stress that the 
eigenvalues are continuous, which implies that for the singlet solution the idea of a 
simple ¢-dependent pole must be abandoned. The perturbative pomeron is a branch 
cut in the complex angular momentum plane. 

However, in order to analyse the behaviour of this structure in more detail, consider 
the leading contribution, which is located at v = n = 0. Expanding w for small v and 
n = Q yields 
(2log2 — 7¢(3)v? +...) & EN eIOE? x 2.65as. (5.224) 
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The leading contribution to fi therefore reads 


k 1 T PITA k? 
k ~ l = .22 
filka, ko) ENB- A oo | g 8 Re | (5.225) 
ae AN, log 2 14¢(3)N 
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5.2.5.12 Total gluon-gluon cross-section 


Undoing all the steps until now, Laplace transform, relation between discontinuity 
of the amplitude and total cross-section, etc., then results in the total gluon-gluon 
scattering cross-section in multi-Regge kinematics 


with 
fS (ka, kp, y) 
k2 
= ves À Sjo exp fa v, n)y + iv log By + in(ea = $0) | ; 
(5.228) 


and where ka and kp denote the two outermost, i.e. most forward, gluons. The pref- 
actor accounts for the averaging over incident colours and spins. This is schematically 
depicted in Fig. 5.9. 

Integrating over the azimuthal angles means that only the n = 0 term contributes, 
and thus 


(tot) 2.2 R 2 
N ke 
nilar ETS cas / dv exp ee n = 0)|ya — yo| + iv log a 


dk? dk. Ake kB E kê 
N? am 1 1 2 k? 
x cus exp | Alya — Yl log a 4, 
4k? kB 4/Blya — yl 4B\ Ya — Yo ke 
(5.229) 


As before, the leading singularity in the complex /-plane is given by l = 1+ A, leading 
to the rise of the total partonic cross-section with 84, in violation of the Froissart 
bound. This rise is encoded in the first term of the exponential, after realising that 
the rapidity difference |y, — y| is proportional to log. The critical exponent A is 
about A ~ 2.5a, at leading order, which reduces by about a factor of two at higher 
orders [518]. Another feature to note is that the form here includes a Gaussian in 
log(k? | /k? |), with the peak at balanced transverse momenta, ka = ky, and a 
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Fig. 5.9 Sketch of the scattering process gg — gg to all orders. The dou- 
ble-lined box represents the exchange of a singlet gluon ladder, indicated 
by a few gluons and some dotted lines hinting at further gluon, giving 
rise to the function f*"8(ka, ko, y) in Eq. (5.227). The vertical dotted line 
shows that this is actually the discontinuity of the amplitude. 


width growing with the rapidity distance of the two. This is not too surprising since 
the BFKL equation could also cast into the form of a diffusion equation with the 
diffusion rate given by log(k? | /k? 1). 

Performing the integration over transverse momenta with a cutoff ka,1, kot > PL 
ultimately yields 


22/2) F w(v,n=0)|ya— 
Glki>p.) — Neos (Pi) I WE ( yao 


99>99 Ap? p2 +4 
AN-ds 7 eS log 2 
ae exp ( (pi )[Ya — yel log ) 
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2p. EO — yol 
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This exposes the exponential growth of the cross-section with the rapidity distance 
of the two most forward partons, synonymous with a power-like growth in s. Taking 
[Ya — y| = log(8/p? ) allows the determination of the resulting approximate K-factor 
as a function of the parton centre-of-mass energy, and for different values of p,. Simple 
inspection shows that the K-factor increases steongly with the logarithm of §/p? , as 
expected. The Mueller—Navelet jets [771] aim at probing exactly this regime by fixing 
the x and vary the rapidities of the two most forward jets and by then measuring the 
dijet cross-section. This implies that the hadron centre-of-mass energy scales with the 
partonic one, i.e. with the exponential of |ya — yo]. 
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The azimuthal correlation of the two most forward jets is another interesting fea- 
ture of the cross-section in the BFKL approximation. In the Born approximation for 
gg — gg, the two gluons are back-to-back in transverse space with exactly the same 
transverse momentum, i.e. kai = ky. and |¢a — o| = 7. This picture changes when 
higher-order corrections in the BFKL approximation are taken into account. Since 
there the leading term is given for n = 0, naively these two jets become entirely 
decorrelated. In addition, due to the diffusion property of the BFKL equation, also 
the correlation between their two transverse momenta fades away as their rapidity 
distance increases. This, the decorrelation of forward jets at large rapidity distances in 
transverse space, actually constitutes a possible test for the onset of BFKL dynamics. 


5.3 Parton shower simulations 


In this section, the simulation of QCD radiation in the production of strongly inter- 
acting particles at large scales through the parton shower picture will be introduced. 
Starting from an alternative, quite pictorial, introduction of the Sudakov form factor 
underlying this framework, details and issues of concrete implementations of this idea 
will be presented, and their connection to hard matrix elements at Born level will be 
discussed. 


5.3.1 Underlying ideas 
5.3.1.1 The Sudakov form factor: A simple example 


In order to re-introduce the Sudakov form factor as the driving term of a probabilistic 
simulation of multiple parton radiation, a detour will be taken. Consider, as a toy 
model, the decay of a radioactive isotope with half-life r. Ignoring trivial factors, the 
probability that starting with an intact nucleus at t = 0 the isotope is still intact at 
time t is given by 


prodec. (4, 0) = exp -+ = exp |-Tł] . (5.231) 


T is related to the decay width of the isotope through I = 1/7. Of course, the proba- 
bility that the isotope has decayed therefore is 


P(t, 0) = 1 — Pmt (t, 0) = 1 — exp [Ti] . (5.232) 


This is a very simple example of unitarity, written in the form of probability conser- 
vation: the isotope has either decayed or not. 

Differentiating the decay probability thus yields the probability density that the 
isotope decays exactly at time t: 


apaec- (t, 0) aprodec: (t, 0) 


_ = an = ; nodec. 
T = T: =T exp|-rt =r. P (t, 0). (5.233) 


To make things a bit more interesting, consider a case, where for some reason the 
decay width is time-dependent, [ —+ T(t). In this case, Eq. (5.231)) becomes 
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t 
prodee- (t, 0) = exp | — J A'T), (5.234) 
0 


and, consequently, the probability density of Eq. (5.233) is replaced with 


dP aes, 0) qp»odee: (t, 0) nodec. 

T = EP =T(t)-P (t, 0). (5.235) 
The no-decay probability P4° (t, 0) of course is nothing but the Sudakov form factor 
A(t, te) encountered for the first time in the framework of resummation, Eq. (2.151) 
in Section 2.3.1, where multi-photon radiation has been discussed and the Sudakov 
form factor ©(Q? ) was also interpreted as a probability, just like here. 

The question remains how these equations can be turned into a numerical simula- 
tion. The answer, on the face of it, is obvious: since the Sudakov form factor represents 
a probability for the isotope not to decay in some time interval, it should result in a 
number between 0 and 1. Due to its form as the exponential of a non-positive number, 
this is trivially fulfilled. 


5.3.1.2 Connection with single parton splitting 


The previous discussion can now be directly extended to the case of parton branching 
for partons of type a € {q, g}, through suitably replacing the time-dependent decay 
width I(t) of the isotope with the integrated splitting function, ['.(T, t), previously 
introduced in Eq. (2.178), times the strong coupling and a propagator factor: 


S Ratt st 
ra — SiD (5.236) 
T t 
such that the Sudakov form factor for parton splitting takes the form 
dt’ as 
A(T, t) = exp -f Zera t’)| . (5.237) 


This has already been encountered in Section 2.3.2 and in Section 2.3.3. The notable 
difference with respect to the toy example of the radioactive decay is the fact that the 
time evolution from a start time tg = 0 to the decay time tae, has been replaced with 
an evolution from a large scale T to a smaller scale t. It is no coincidence that this is 
identical to the form of the Sudakov form factor in Qr resummation, as the integrated 
splitting functions have identical coefficients A and B® at leading order. 


5.3.1.3 Infrared cut-off, virtual contributions, and unitarity 


Taking a closer look at the integrand of the Sudakov form factor, essentially given by 
I(T, t')/t’ ~ log(T/t’)/t’ indicates that the available scales for the parton splitting 
should be bounded from below in order to guarantee the convergence of the integral 
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over the scale. In other words, introducing an infrared cut-off te, typically of the 
order of a few Agop, the additional constraint 


tU>t.>0 (5.238) 


must be satisfied. For any given large starting scale T, and for a scale-independent 
strong coupling, there is the probability 


CF AQs FE s T 
Ag APs te) = 1 exp | (o i Ao) log 3) 


CF, AQs T = T 
= (o 7 Ao) log =) +0 (a2) 


T 


(5.239) 


for a quark or gluon to emit a gluon above the resolution scale te, which increases with 
increasing T/te. Comparing this result with the Sudakov form factor of Qr resumma- 
tion, Eq. (2.166), the terms A® = Cyg and BY = —4{) = —A”) 4 are readily 
identified. The naive mismatch by a factor of two between the two formulations, par- 
ton showers and analytic resummation stems from the fact that in the former each 
emitter is treated individually while in the latter, and in particular in the discussion 
in Sections 2.3.2 and 5.2.1 the case of two incoming partons of the same type — quark 
or gluon — fusing into a colourless state has been considered. The corresponding com- 
bined Sudakov form factors in this case account for both of the incoming partons, 
explaining the relative factor of two. Note also that terms of higher order in as have 
been omitted here, although the term equivalent to A®) is easily included as well. 

The probability not to emit any resolvable gluon decreases with increasing ratio 
T/te, 


C AAs T he T 
Aq,g(T, te) = exp | = (o F A) log )| 


z T 
CF AQ (oe 


T 


(5.240) 
=1 


at O(as). For large ratios T/te, the emission term ~ a, can become larger than unity, 
which leads to this expression turning negative at O (as). In such a case the higher-order 
terms encoded in the exponentiation will also increase and thereby guarantee that 
the exponent of the Sudakov form factor remains negative, rescuing the probabilistic 
interpretation. 

Due to its probabilistic nature, the Sudakov form factor incorporates real, resolv- 
able emissions as well as unresolvable ones, although only the former ones appear 
explicitly as the driving term. The unresolvable ones are taken into account only by 
the introduction of the cut-off and the interpretation of the Sudakov form factor. Dia- 
grammatically speaking, such unresolvable emissions can be attributed to either the 
soft and/or collinear real gluon radiation at scales below the cut-off te, or to the virtual 
diagrams, see Fig. 5.10 for a pictorial representation. Since both exhibit infrared di- 
vergences which cancel each other, and because the expression in Eq. (5.240) is free of 
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Fig. 5.10 Diagrammatic representation of the cancellation of divergences 


in unresolvable and virtual corrections and of large logarithms in their sum 
and the resolvable emissions. Here, the case of an emitting quark is shown. 


pole terms 1/e, it becomes apparent that the Sudakov form factor resums all large log- 
arithms of the type log(T/te) at leading order, stemming from unresolvable and virtual 
diagrams, or, conversely, the same logarithms originating from the resolvable emissions. 
This is nothing but the celebrated Kinoshita-Lee-Nauenberg theorem [678, 724], for- 
mulated in a probabilistic way. In this fashion the unitarity-conserving character of 
the parton shower manifests itself: by employing a probabilistic picture virtual and 
unresolvable emissions are inherently accounted for. 

Fixing the problems related to the divergent low-scale behaviour through the intro- 
duction of the infrared cut-off te, the actual scale of this cut-off needs to be discussed. 
There are two considerations driving this choice. For one, a well-defined description 
of radiation in a well-understood perturbative framework with only two parameters 
(ag and te) is preferable over a description resting on phenomenological models with 
many parameters, which, at best, are understood only in qualitative terms. Such phe- 
nomenological models describing the emergence of hadrons at the end of the parton 
shower are discussed in more detail in the section introducing the ideas underlying 
hadronization models and their implementation in Section 7.3. This line of reasoning 
drives the infrared cut-off to minimally small scales. On the other hand, due to its 
confinement property, the perturbative description of strong interaction processes will 
eventually break down. This break-down of perturbation theory typically is related to 
scales of the order of Agcp. Combining both considerations motivates choices for te 
of the order of about 1 GeV”. 

On a similar note, also the hard scale T must be fixed. In previous sections dis- 
cussing analytic resummation techniques, it became clear that this choice is quite 
important, as it also drives the size of the Sudakov logarithms. Bu there is no recipe 
based on first principles or a universal scale choice: it is process-dependent. While for 
simple topologies as in Drell-Yan-like processes the choice is fairly straightforward, 
something of the order of the invariant mass of the colour-singlet system, more intri- 
cate colour topologies will have to be described by more complicated choices. As a 
rule of thumb one could argue that the typical scale related to the distortion of the 
colour flow presents itself as a reasonable choice. It can be argued that indeed, inside 
the parton shower, such a scale choice is the most advantageous, as it resums certain 
classes of sub-leading logarithms. 
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5.3.1.4 Simulating a single parton branching 


The determination of the full parton decay kinematics involves parameters beyond 
the scale related to the decay. First of all, a variable z is necessary, parameterizing 
the splitting of energy or, theoretically better motivated, light-cone momentum in the 
branching process a — bc. For the simulation of parton splitting, the integrated 


splitting function T'.45-(t, te) to first order in as must be replaced by the differential 
one PY (2 a= PY 2), where the exact parton composition of the splitting process 
has been made explicit. In addition, an azimuth orientation @ must be included. In 
the single emission terms forming the Sudakov form factor of analytic resummation 
this parameter has been inconsequential and it appeared only as a constraint in the 
overall four-momentum conservation. 

Taken together, the Sudakov form factor suitable for the simulation of a — bc 
becomes 


Aagsbe(T,t) = exp -/ fe [gaeta (a POl. (5.241) 


Here, the limits of the z-integration, z+, emerge from the need to reconstruct the 
kinematics of the splitting obeying exact four-momentum conservation. They therefore 
depend on the specifics of the scale choice, etc., which will be discussed in more detail 
below. It is worth stressing that the introduction of an infrared cut-off of the parton 
shower will guarantee that these limits cut out the divergences in the splitting function. 


Another subtlety arises when actually constructing the decay kinematics param- 
eterized by {t, z, dé}. A massless particle on its mass-shell in general cannot decay 
into two other massless particles, due to four-momentum conservation. This translates 
into the necessity to involve other partons, which will “donate” some four-momentum, 
absorb the recoil of the decay and thus guarantee local four-momentum conservation. 
This role is typically filled by the colour partner of the decaying parton. The resulting 
reshuffling of momenta of course is very minor in the case of soft and collinear split- 
tings and vanishes in the limit where the invariant mass of the produced two-body 
system approaches the invariant mass of the decaying parton. Therefore, the details of 
the momentum shuffling are beyond the intrinsic accuracy of the parton shower. For 
emissions away from the soft and collinear regions they start to become increasingly 
important. In order to characterize a parton shower, it is thus necessary not only to 
define the interpretation of the parameters t, z, and ¢, but also to define precisely how 
four-momentum conservation is achieved. In the next section, Section 5.3.2, the actual 
construction of parton showers will be exemplified through some typical realizations. 


5.3.1.5 Initial-state parton showering 


A new problem manifests itself when simulating multiple emissions off initial state 
partons through the parton shower. This is due to the simple observation that, in 
marked difference to the radiation off final state particles, in initial-state radiation 
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both the incoming hadrons at the low scale and the parton entering the hard interac- 
tion at the high scale are fixed. Of course, to circumvent this problem, a naive idea 
would be to just start the parton shower evolution from both hadrons and arrive at 
a hard scatter. A quick glance at various cross-sections relevant for phenomenology 
reveals that such a procedure would be prohibitively inefficient, since all interesting 
processes have cross-sections that are many orders of magnitude smaller than the 
inelastic hadronic cross-sections. Therefore, in order to arrive at any statistically sig- 
nificant sample of simulated events it is unavoidable that the hard interaction is fixed 
first and only subsequently dressed with multiple parton emissions and further stages 
of event simulation. 

In final-state radiation, all external hadrons in the final state are equally permis- 
sible, while in the case of initial-state radiation the incoming hadron of course is fixed 
by the collider setup. Since the parton shower evolution preferably is described as an 
evolution from the large to the small scales, the forward evolution in the simulation 
of final-state radiation thus becomes a backward evolution for emissions off the 
initial state. 

The theoretical motivation is as follows [851]. The parton distribution function 
entering the calculation of the hard process cross-section, cf. Eq. (2.52), already em- 
bodies an inclusive summation over all possible initial-state showers starting from a low 
hadronic scale and arriving at the hard factorization scale up. The scaling behaviour 
of these PDFs is given by the DGLAP evolution equations, Eq. (2.31). Schematically, 


d Ds 
hale = oe ee =) fanla’, t). (5.242) 


This needs to be turned into an expression for the probability of parton b disappearing 
from x during a small decrease of scale, dt. A simple way of achieving this is to 
divide the equation above by fy/n(a, t). To first order in as this leads to the following 
expression for the parton decay probability: 


— Afeyn(@,t) dt a da’ 5, fanla, t) 
Aa fole, t) t a (5) eE (5.243) 


Exponentiating this expression for the individual splitting, as before, yields a Sudakov 
form factor, this time for backward evolution. It encodes the probability for a process 
not to occur, where a parton b at momentum fraction x in the initial state is replaced 
by a parton a at 2’ = x/z under emission of parton c. 


t d d X (t, R zy 
aott = opf- f E fe aame po o hale 
fojn (æ, t') 


(5.244) 


The only visible difference to the forward evolution is that here also ratios of PDFs 
enter. Their role is to assure that, starting from a hard scale, one actually arrives 
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Fig. 5.11 Diagrams which are effectively resummed in analytic calcula- 
tions as well as in parton showers by choosing the scale in the strong: 
coupling to be ~ k? . For simplicity, typically as(k? ) is assumed. 


at the “right” initial hadron, and that on the way to this lower scale emissions are 
unfolded respecting the DGLAP evolution equation. A simple way to see that this 
is indeed the case is by realizing that for every splitting to take place, the PDF for 
parton b at its values of x, and t, is replaced by the PDF for the parton a at lower 
scale ta < ty and larger £a > x, as encoded in the ratio of the PDFs in Eq. (5.244). As 
a further welcome consequence of this backward evolution, momentum conservation 
is trivially guaranteed, i.e. no x’ > 1 can be chosen — the PDFs just vanish there. 
This effectively constrains the lower limit of the z integral to be larger than x: z_ > a. 
In addition, the flavour symmetry of the PDFs, in principle treating all (massless) 
quarks on the same footing is broken. For instance, moving towards larger z’ and 
smaller scales t, the flavour-symmetric sea components of the quark PDFs vanish and 
the flavour-unsymmetric valence contributions emerge. 


5.3.1.6 Quantum corrections: Running of a, 


In Section 2.3.3 suitable evolution parameter definitions, related to either (scaled) an- 
gles or transverse momenta, were discussed to guarantee the resummation of leading 
and next-to-leading logarithms in the description of multiple emissions in the final 
state. In addition, the running of a, demands a specification of the scale at which 
it is to be evaluated; as argued before in order to resum next-to-leading logarithms 
the correct scale is given by the transverse momentum of the splitting process. This 
includes a certain class of quantum corrections to the treatment of the individual 
emissions, see Fig. 5.11. To go even a step further, universal higher-order correc- 
tions may be added to the universal soft gluon term, which effectively amounts to 
adding a term ~ a? to the splitting kernel. Comparing with the A?) terms in analytic 
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resummation, cf. Eq. (5.65), motivates the replacement 


aul) _, alt) , a les (% =) = (5.245) 


also in the soft terms of the parton showers, see below 

This choice of scale in the strong coupling poses an additional constraint on the 
infrared cut-off te. It necessitates the minimal transverse momentum in a parton split- 
ting, kmin, to be sufficiently above the QCD scale Agen in order to avoid the Landau 
pole of as. Typically, the infrared cut-off of the parton shower is a parameter to be 
adjusted to data and in the region of 


kim ~ 1GeV > Agocp. (5.246) 
5.3.1.7 Quantum corrections: Angular ordering 


In Section 5.1.1 the notion of angular ordering of subsequent emissions has been 
identified as an important feature of the QCD radiation pattern beyond the naive 
double-leading logarithmic approximation. 

For parton shower simulations angular ordering implies that the exact choice of the 
evolution parameter t is essential for the formal accuracy in terms of leading and sub- 
leading logarithms. In the past, different choices for the form of the ordering parameter 
t have been implemented and formed the basis of parton showers successfully describing 
data. These choices include the following properties of the produced pair: 


e invariant mass, t = Q?, employed in the very first parton shower realizations [216, 
533, 593, 766, 788], 


e opening angle, t = 6”, taking into account quantum coherence effects [579, 750], 


e and transverse momentum t = pf , similarly including coherence effects [603, 729, 
731, 778, 805, 856], 


While all three choices exhibit logarithmic behaviour in the limit of small opening 
angles, the exact form of the logarithms differs for these three choices; in particular it 
was found that only the latter two systematically incorporate the effects of quantum 
coherence. 

The effect of not taking into account such effects in hadronic collisions was high- 
lighted through an analysis performed by the CDF collaboration during Run I of 
the TEVATRON. The analysis studied the angular distribution of a third jet in QCD 
events [111], cf. Fig. 5.12, where its pseudo-rapidity distribution is depicted. The jets 
were defined by the midpoint algorithm with a radius of R = 0.4 and a minimal trans- 
verse momentum Be > 10 GeV in a pseudo-rapidity region given by |7e)| < 2. In 
addition, the first (hardest) jet was demanded to have at least a transverse momentum 
of pie) > 110 GeV. A number of observables are sensitive to angular ordering (QCD 
coherence) effects, the most intuitive ones being the 7-distribution of the third jet and 
its spatial distance R in the 7-¢ plane from the second jet. 

Both can directly be related to angular ordering, when considering the colour flow 
in typical jet events in hadron collisions, cf. Fig. 5.13. The underlying hard process can 
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Fig. 5.12 The effect of colour coherence in QCD jet events at Run I of 


the TEVATRON, by the CDF collaboration [111]. 


Fig. 5.13 Typical colour connections (dashed) in various parton—parton 
scattering processes: gq — qq and qq —> qq scattering (top row), and 
qg — qg scattering (bottom row). In some cases, there are more than one, 
possibilities in the large-N. limit, but usually partons in the initial and 
final state are colour connected, thus giving rise to the angular ordering 


pattern discussed in the text. 
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be visualized as two incoming and two outgoing partons, where usually each outgoing 
parton is colour-connected to one of the incoming partons. This gives rise to a maximal 
angle Omax for the emission of the third parton, which thereby effectively is constrained 
inside cones of radius max around potential emitters — the incident or outgoing 
partons. Identifying partons at leading order with jets explains the features in jet 
production seen in the experiment. Ultimately, these findings lead to the first choice 
of evolution parameter, invariant mass, to be supplemented by an explicit angular veto 
in an improved version of the PYTHIA event generator [214, 215]. 


5.3.2 Realization: Parton shower algorithms 
5.3.2.1 General thoughts 


In this section, a number of different parton shower implementations will be briefly 
introduced. In general, the specific realizations of the more general ideas outlined in 
the previous sections will all have the same parametric accuracy, typically on the 
level of leading logarithms. There are a number of subtleties in each of the actual 
implementations, which may further enhance their accuracy or which may lead to 
quantitatively different behaviour. 

In order to stress the general nature of the discussed parton showers, from now on 
the splitting kernels are denoted as K,;.4(®1), where typically the one-particle emission 
phase space ®, is written as a function of the ordering parameter t, the energy or light- 
cone momentum splitting parameter z and the azimuth angle ¢. There are a number 
of requirements on the kernels and the kinematic parameters and scales which must 
be fulfilled in order to make contact with, e.g., the DGLAP evolution equation or 
analytic resummation. There are also a number of choices that ultimately define the 
actual implementation: 


1. The form of the splitting kernels for the splitting of a parton (ij) into two partons 
i and j under the presence of a spectator k, Kij;k(®1). These splitting kernels 
are subject to the requirement that they exhibit the same universal singularity 
structure as real-emission matrix elements at leading colour. In particular, they 
must reproduce the collinear and soft-collinear limits. This implies that in the 
collinear limit they are constrained to reduce to the usual DGLAP kernels P(;;);. 
The soft limit in on the other hand is more tricky. This is because, in contrast to 
the collinear limit, it also receives interfering contributions at sub-leading colour 
which cannot be reproduced by using simple single-emitter algorithms. At leading 
colour, however, the soft limits must be correctly reproduced. 

Phrasing these considerations in a more formal way, the infrared limits are given 
by pip; —> 0, where the soft limit in addition is characterized by E; — 0 or, 
equivalently, z > 1. Then the kernels should satisfy 
1 F 
— Puji (2(®1)) for z1 (collinear) 
Kijn(®1) — ¢ PP (5.247) 


1 
pees for z—1 (soft). 
PiPj PjPk 
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It should be noted here that the form of the kernels in the soft limit given here 
may possibly not work for dipole showers, since it potentially leads to a double- 
counting of the soft region due to the symmetry of the eikonal. This problem can 
be remedied by replacing the eikonal with a similar form which will reproduce the 
eikonal when emissions of both parts of the dipole are added: 


z>] 1 — Pipk 


Kijk (® . 
jr(®1) Pip; (Di + De)D; 


(5.248) 


For more details, and a connection to, e.g., the Catani-Seymour subtraction 
method, cf. Section 3.3.3; for a discussion in the framework of constructing algo- 
rithms for parton showering, see for instance [807]. 


. The precise choice of evolution and splitting variable and the choice of scale in 
the strong coupling. Together with the exact form of the splitting kernels this 
typically influences the formal accuracy of the parton shower. At this point it is 
worthwhile to stress, again, that in principle different evolution parameters will 
often lead to identical leading logarithmic (LL) accuracy, as can be seen from 


dkij; — dm a6}, 


Se et a, (5.249) 


where k, is the relative transverse momentum of particle i and j after the 
splitting, mij is their invariant mass, and ĝ;j their opening angle. This iden- 
tity becomes apparent by realizing that in the limit of small emission angles, 
ki = zy — zij) 0y Egy and mj, = k3 /(zj(1— j)), where Euj) is the energy 
of parton (ij) before the splitting, and z;; is the light-cone momentum fraction i 
retains. 

Following this line of thought, differences in the formal accuracy of the parton 
shower manifest themselves in the sub-leading terms, i.e. terms of next-to-leading 
logarithmic (NLL) accuracy. The logic in this reasoning is related to the consider- 
ations in the framework of Qr resummation, cf. Section 5.2.1. A formal treatment 
has been given in [356]. 


. The way the kinematics of individual splittings are constructed, once its param- 
eters have been fixed. This usually impacts the way in which the parton shower 
fills the emission phase space and therefore how it relates to matrix elements 
handling additional emissions. In general this can be quantified by a kinematical 
map, which can be characterized by four cases with splitters and spectators in 
the final (F) or initial state (I). For the sake of clarity, in the detailed discussion 
of the individual shower algorithms here and below, final-state particles will be 
denoted with 7, j, and k, while initial state particles will be labelled with a or b. 


FF : Dij + Pe — pit Dj + Pr 

IF : Paj + Pk — Pa + Pj + Pk (5.250) 
FI: Pij + Da — Pi + Dj + Pa A 

Il: Paj + Po — Pat pj + po 
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The subscripts i, j, and k label the splitting parton, the emitted parton, and the 
spectator, all after the emission. The Sudakov form factor for a thus defined emission 
between the two scales t and T reads 


T 

dt! d 

AVIT, i) = exp -JẸ fa f Sse, z, $) Kigz,a(t, z, $) 
t 


(5.251) 


T 
= exp - [a Kijk (®1) 
t 


The quantity J(t’, z, ọ) in the first line of this equation takes care of any Jacobean 
that becomes necessary, but it is suppressed henceforth and assumed to be part of the 
one-particle phase-space integral in the following line. 

For simplicity, parton luminosity factors as well as the specific coupling structure 
also including all charge factors are included in the splitting kernel. In all imple- 
mentations of Eq. (5.251) the argument of the coupling is related to the transverse 
momentum of the splitting, given by the decay kinematics ®1. 


5.3.2.2 Virtuality ordering 


One of the first full-fledged implementations of a parton shower covering all aspects 
of initial- and final-state radiation was based on an ordering of the emissions by the 
virtual mass of the splitting parton: virtuality ordering, see also [216, 533, 593, 766, 
788] 

A parton shower with this ordering paradigm has been employed in early versions 
of the PYTHIA framework [852]. It is realized as a final-state shower [214, 215, 786], 
where the spectators are typically final-state particles as well, and as an initial-state 
shower [766, 851], with the other initial-state particle as spectator. The only exception 
is the first final-state splitting of a particle that has just been emitted from an initial- 
state parton, for which special arrangements are made. In general, the algorithm is 
built on the parton shower evolution being driven by 1 — 2 splittings, (ij) > i+j. The 
spectator parton k typically is defined through the configuration of the hard process 
for the first emissions, or where it is related to the splitter (ij) by having a common 
splitter, i.e. (ijk) > (ij) +k. 


1. Splitting kernels 
In both cases, FF and II, the evolution kernels Kj; are given by the leading 
order splitting kernels of the DGLAP evolution equation for the fragmentation or 
parton distribution functions, Py;;);, cf Eq. (2.31). 


2. Evolution and splitting parameters 
The evolution and splitting parameters t and z are given by 
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2 = 
OY) =, = 8, = (itp)? and 2°") = zy = Buy 
(11) (1) ye E pee 
tD = taj = |p2,|=\(pa—p;)?| and z ap 5 
j = |Pajl = |(Pa — Dj)" 7 aj Eaj 


In both cases z is directly identified with an energy-splitting parameter. Different 
choices concerning the reference frame have been provided, with the default being 
the c.m.—frame of the hard scattering. It should be noted, though, that up to 
now the impact of kinematics has not yet been included. The limits of the energy 
splitting are given by 


1 tij 
t= 5 (: + /1= Ae | O(tij — Miz) (5.253) 
ij 


where maj) is the on-shell mass of the splitting particle (ij), Euz) its energy, fixed 
in the previous splitting, and t;j is its virtual mass that has already been fixed by 
the Sudakov form factor. This choice assumes the offsprings 7 and j are massless 
— some momentum reshuffling will have to take place once they acquire a mass 
through their further splittings. This effectively will lead to a reinterpretation of 
the energy-splitting parameter of the previous parton branchings, as sketched in 
the following discussion (for further detail see the original literature). 

The infrared cut-off scale is given by a parameter Qo such that it depends on the 
physical mass of the splitting parton, mj): 


t > tD =p mnn + 2. (5.254) 


Expressed by the parameters above, and assuming massless partons, the argument 
of the strong coupling is given as 


a(k?) with k2 = i (Eci AER) (5.255) 


(1—2aj)taj (I) 


while the scale argument in the PDFs in the backward evolution is given by the 
respective t of the splitting. 


. Construction of kinematics 

The kinematics of individual FF splittings are constructed in the following way. 
The recoil partner is selected to be either the other particle in the hard 2 > 2 
scattering, if it is the first splitting in the process, or the other particle being 
generated in the splitting, or the particle emerging from them and being the 
colour partner. These possibilities are illustrated in Fig. 5.14. Having fixed t and 
z for a splitting (ij) — i+ j with massless partons i and j, the decay kinematics 
are fixed by realizing that 
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Fig. 5.14 Recoil partners in final-state parton showers. For the splitting 
(kl) > k+l, with th > tij, (ij) took the recoil, with k being colour 
connected with (ij). In the subsequent splitting (ij) —> i+ parton k will 
be the recoil partner. Assuming k to be the next parton to split, it will be 
the turn of l to act as recoil partner in the construction of kinematics, and 
in turn one of the decay products of k will compensate for four-momentum 
imbalance in the splitting of parton l. 


t =(p +p? = 2E BO (1 — cos 6;3) 


(5.256) 
E =2jE+(1—2j)E = BO +E. 


With the offsprings becoming massive this is changed by reshuffling momenta to 


arrive at 
Pij = (1-— rij) + 54D ' (5.257) 
where 
tij + tig — tji tis ti — tj)? — Atyt; 
fij = ti, (5.258) 


In emissions off initial-state particles, the energy-splitting parameter z;; will lead 
to a rescaling of the Bjorken-x of the emitter, £t; = Znew = Told/Žij = Liz /Zij, 
where z is obtained from the argument of the Sudakov form factor, essentially the 
splitting kernel multiplied by the ratio of PDFs, 


1 
| dingy Ua) Silta ta) (5.259) 
2m  Tijfij/h(Tij» tiz) 

It will always be the other initial-state particle that will account for the recoil, 
thereby boosting and rotating the full system. The algorithm is basically such that 
the Bjorken-x parameters of both initial-state particles, splitter and spectator, fix 
the centre-of-mass system. 

To see how this works in more detail, consider a kinematical situation like the 
one depicted in Fig. 5.15, where two particles a and b collide to produce a final 
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Fig. 5.15 Recoil partners in initial state parton showers. 


state with total four-momentum squared z§ = §8.;. Particle a will eventually 
emit a parton 7 in the backward evolution, thus acquiring a negative virtual mass 
p = Ba, < 0, its absolute value readily identified as the evolution parameter, 
A = taj- The momentum of the emitted particle, pj, can be fixed by using 
local four-momentum conservation, i.e. pj = Pa — Paj- This leaves the task of 
constructing the four-momentum of the new initial-state parton, pa and the emit- 
ter after emission, Paj- The centre-of-mass squared of the emerging system, §,;, 
is given by rescaling the original s.; by the splitting parameter, 


a 555 
Sab ~ (Pa + pj)? = rae (5.260) 


which will provide one constraint on pa. In the original implementation of the 
virtuality-ordered parton shower, it was assumed that both original incoming 
partons acquire a virtual mass with tg; > to = QI > 0. The logic of the 
ordering then implies that the kinematics of the backward-splitting aj —> a + j 
is constructed first. This is achieved in the c.m.-system of the two momenta Paj 
and pp, with energies and momenta pll along the beam axis given by 


2y Sa 
(5.261) 


2 
(825 + Gas + Quy) — 4AA 
Plaj) on T * 48-5 l 


In this system pa and p; will have a transverse momentum with respect to the 
||-axis; its azimuth angle is one of the four degrees of freedom of, say, pa. It is 
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chosen isotropic, and Eq. (5.260) fixes another d.o.f.. A third one is fixed by the 
mass of pa, Qok = |p*,, given by the next showering step, where a parton k is 
being emitted. This leaves a fourth d.o.f., which is identified with the virtual mass 
of the outgoing parton j. The latter is fixed by the final-state parton shower, i.e. 
the further splitting of this parton, subject to kinematic constraints and, possibly, 
by considerations invoking quantum coherence. The kinematic constraint on the 
mass assumes a completely collinear branching, (aj) || Pa || Pj, and reads 


Gajdak — TajTak 
th = pp <r Qa — Qark) > (5.262) 
2u) 


with 


qaj = 8a, + Qas) = Qon 


be Sab 
dak = Sab T Qak) = Qon = + Qak) = Qon 


qaj = ORTO 


Taj 
Tak = dak = 4Qlar) Qo « 


Usually, in virtuality-ordered showers, there would be the additional constraint 
due to the ordering in virtual masses, demanding that t; < Q? >» = taj, the value 
of the evolution parameter, where the parton was produced. After the splitting 
has been reconstructed in this way, the full system, including the full final state 
that has emerged so far, is boosted and rotated back on the beam axis. For more 
details on this and various other implementation issues, cf. [852]. 


This algorithm has been improved to approximate some of the effects of quantum 
coherence with an a posteriori-fix. This fix consists of a veto on increasing splitting 
angles, applied after each parton emission. For example, for final-state splittings, the 
opening angle of the splitting (ij) > i + j can be estimated as 
PL 1 tij 


bi x Ei x ; (5.264) 
aE; Ej zij (l — zij) Eaj) 


Denoting the kinematic variants related to the splitting of parton 7 with subscripts ; 
then leads to the angular ordering constraint 


Oi < bij > ; > j j (5.265) 
$ ij 


cf. [214, 215]. 
It should be stressed, however, that implementations where the parton shower was 
organized through an ordering in invariant masses are not usually used any more, 
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irrespective of whether it has been supplemented by an angular veto or not. One of 
the main reasons is that such showers inherently do not respect quantum coherence, 
with a number of implications. First of all, their emission pattern tends to generate 
a bit of an excess in soft radiation. Second, it is not quite clear how such showers 
can be improved beyond the leading logarithmic accuracy. Finally, and probably most 
relevant in view of the current use of parton showers, typical algorithms that match 
parton showers with exact higher-order calculations, rely on an ordering of emissions in 
“hardness”, usually given by the transverse momentum of the splitting. In virtuality- 
ordered showers there is therefore a mismatch of the evolution, and quite frequently 
not the first but subsequent emissions off a given starting configuration are the hardest 
ones. This renders the matching of the hardest emission between parton shower and 
the fixed-order calculation a formidable task. Because of these reasons, it has become 
customary to use transverse-momentum or angular-ordered parton showers instead. 


5.3.2.3 p,-ordering 


The original idea of ordering emissions according to the transverse momentum has 
been introduced in the framework of showering algorithms based on colour dipoles or 
colour antennae, [603, 729, 731, 805], see in the following sections. It has been adopted 
as a way to construct a parton shower only some time later, in [856], in the framework 
of the PYTHIA event generator. By now, this is the default parton showering algorithm 
in the latest versions of PYTHIA 6 and in its replacement, PYTHIA 8. Similar to the 
virtuality-ordered parton shower, again there is a distinction between final-state and 
initial-state parton showers. 


1. Splitting kernels 
As in the virtuality-ordered parton showers, in both cases, FF and II, the evolu- 
tion kernels Kj;,, are given by the leading-order splitting kernels of the DGLAP 
evolution equation, cf. Eq. (2.31). In addition, FI configurations are considered, 
again using the DGLAP splitting kernels for the evolution. 

2. Evolution and splitting parameters 
For final-state parton showering, i.e. FF and FI splittings, the evolution variable 
is given by 

Piaj = 2g(1 — zij) (tij — Mij)» (5.266) 


where, as before, mj) is the physical on-shell mass of the splitting particle, and 
tij is its invariant mass which is now obtained after fixing the decay kinematics, 
Le. p% ij and z;;. In terms of momenta after the splitting, zij is defined as 


1 Tı 


where 


oe DFR 2 2 
kı = oat j (5.268) 
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tig — A(tig, mZ,mF) — mF + mF 


k3 = J 2 
3 T (5.269) 
MA 2(pi + Pj + Ph) Pil; (5.270) 
ii (pi + p; + p)? l 
1 Tı 
oa eae re E = | (5.271) 
with 
Pk = Pk (5.272) 
for FF splittings and 
t; er m?. . 
! — ok |1 2 GD ot 4 Om? 5.273 
aa | (pi + pj + pk}? J es) ( ) 
—1 
tij — m?. : 
4 (ij) 2 
x 14 2tij + 2M2; 5.274 
| (pi + pj + pr)? ( 3| ( ) 


for FI splittings. 

For initial-state showering, the situation is a bit more complicated. Assuming a 
massless parton splitting into a space-like one with a virtual mass of —Q? and a 
time-like one with virtual mass m?, the relative physical transverse momentum 
reads 


Pi = (1—z)Q? — 2Q*/8, (5.275) 


where § is the invariant mass of the system formed from the splitter and the 
spectator — the other parton in the initial state — before the backward splitting. 
The choice of this exact form of the transverse momentum would lead to an 
unwanted ambiguity, when mapping it onto the virtuality of the splitter after the 
emission took place. To overcome this, instead the evolution scale 


t= (1 = z)Q? = Pi evol (5.276) 


is chosen. Close to the bottom and charm thresholds, the evolution variable is 
changed to 
PŽ evol = (1 a 2) (tig F Miz) : (5.277) 


In terms of momenta after the branching, z is given by the Q?-ordered definition 


2P(aj)Pb 
z = ————— 5.278 
2PaPb ( ) 


More details on the construction of this parton shower are given in [856]. 
Construction of kinematics 


The kinematics in the implementation [856] of this parton shower algorithm in 
PYTHIA is constructed by mapping the quantities p, and z onto invariant masses 
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and splitting parameters. The recoil of the splitting is taken by the whole final 
state for initial-state emissions, while the colour-connected particle acts as recoil 
partner for final-state splittings. This is analogous to dipole showers. 

In particular, for FF splittings the branching is performed in the rest frame of 
splitter ij and spectator k, oriented along the positive and negative z axis. In 
the splitting process, the hitherto massless splitter ij receives a virtual mass tij, 
which induces a reduction of the energies and momenta of both splitter and spec- 
tator. For FI splittings, the construction is practically identical. For initial-state 
splittings, the kinematics is identical to the initial-initial kinematics described in 
the discussion of dipole showers below, and in [839] (up to a ¢-rotation). 
Finally, it should be noted that a veto on increasing angles (identical to the 
virtuality-ordered cascade) is usually applied to initial-state splittings. 


5.3.2.4 Angular ordering 


As discussed in Section 5.1.1, angular ordering is a quantum effect stemming from the 
interference of radiation from different emitters. This idea has first been successfully 
employed for the organization of a parton shower in [750, 886], forming the backbone 
of the widely used HERWIG event generator [415]. In various publications, including 
for example [356], it has been shown that angular-ordered parton showers are accurate 
up to next-to-leading logarithms, and that the first contributions that are missed are 
colour-suppressed by at least a factor of 1/N2. 

In its original version, indeed, the opening angles of emissions, scaled by the energies 
of the emitting particles, have been employed as ordering parameters. In an improved 
version presented in [579], a modified angular variable is used instead, which allows the 
inclusion of mass effects in the parton shower in a more straightforward way. It thereby 
eliminates an artefact that is known as the “dead-cone” effect [479, 751].’ Therefore the 
following discussion is based on the angular-ordered shower algorithm in its improved 
version, implemented in the HERWIG ++ event generator [188]. It should be noted that 
in this specific implementation, all outgoing partons have a minimal outgoing mass of 
the order of 1 GeV or below. This allows a very smooth interface with the subsequent 
cluster hadronization model in HERWIG ++. It also enables the parton shower to cover 
the transverse momentum range down to 0 GeV, while in other shower models the end 
of the parton shower is defined by a cut-off in transverse momentum. 


1. Splitting kernels 
The splitting kernels for emissions off both initial- and final-state particles are 
given by DGLAP kernels involving masses, in particular 


This artefact is based on the observation that the masses of the emitting or emitted particles 
shield the collinear divergence in the emission pattern, due to simple four-momentum conservation. 
In the original angular-ordered showers the corresponding suppression of radiation was approximated 
by a hard cut on the emission angle in q > qg splitting, given by 0 > m/E, with m the quark mass 
and F its energy. This cut is too hard, and to obtain a better description, massive splitting kernels 
like the ones below have to be employed. They still exhibit a substantial depletion of radiation in the 
low-angle region, but the transition is smooth. 
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3 1+2? 2z(1— 2 
Paq(2, D1) =cr| ui zl om 


1-z pt +(1—2z)?m? (5.279) 


2 
> 2) -Tr |1- 2z(1 Pi 
Pag(Z, Pi) R z( Ee , 


with the g > gg splitting of course still given by the form in Eq. (2.31). 


2. Evolution and splitting parameters 
The evolution parameter is given by a generalized flavour-dependent angle, opti- 
mized for mapping out the singular structures in the respective splitting process. 
For final-state emissions, 


peek My n 
z2 (1-2) z(l-=-2z) 2(1-z) z(l-z} 
mi z)2 2 | a T Ed iad 
= pui ERT (5.280) 
L Ph forg > gg, 


while the argument in the running coupling is given by 
be = Sd st. (5.281) 


The actual angular ordering condition in terms of scales t (and ¢) for subsequent 
emissions 7 and 7+ 1 is given by 


tiga < z ti and tint < (1—2) ti, (5.282) 


where the splitting factors enter because of the rescaling of the momenta by them. 
In all cases above, Qg min is the minimal virtual mass for gluons and light quarks 
at the end of the parton shower, and 


u = min{m, Qy,min} (5.283) 


with m the mass of the light or heavy quark. The splitting parameter z is the 
ratio of light-cone momentum fractions along the direction of the splitting parton 
before and after the splitting took place, and k 1 is the transverse momentum 
in the splitting with respect to this axis. In its original version [579], the two 
light-cones were fixed by the two hardest partons, for instance the quark and the 
anti-quark directions in e~ e+ — qq. In an improvement this was replaced by using 
the splitter and spectator axes of motion to fix the light-cone momenta. 
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In initial-state branchings, all partons are assumed to be massless (or light) and 
the evolution parameter is given by 


12 2 
= ky T BQ yaa 


t= 284 
with the scale in the strong coupling given by 
u = (27s (5.285) 


The splitting parameter z again is related to the splitting in light-cone momentum. 


3. Construction of kinematics 

The construction of the kinematics in angular-ordered showers is somewhat tricky 
to describe, compared to the other showering mechanisms. This is because the 
actual four-momenta of the particles are fixed only at the end of the parton shower 
evolution, and the kinematics of the intermediate partons is recovered recursively. 
In the final-state case, this recursion is based on writing the momentum of particle 

i as 
qf = aip" + Bin” + di, (5.286) 
where p and n are the two axes of the splitter and spectator directions, with 
p? = m?, the mass of the particle, n? = 0, andn-qii = p-qii = 0. The 
splitting parameters z are given by the ratios of subsequent a’s, zi = Qi/Qi—1 

and 
Pia = Cy — Zil i-1. (5.287) 


With this in mind, the virtuality of parton i— 1 is given by its successors 7 through 


(5.288) 


which is successively applied until the full final-state kinematics is recovered. 
Similar reasoning also allows the reconstruction of the kinematics of the initial 
state shower. The reader is referred to the original literature for a more detailed 
description. 


One feature of angular-ordered parton showers is that they do not usually fill the full 
phase space available for emission. These gaps in the radiation pattern emerge in the 
hard, wide-angle regime and must be filled. In all practical implementations this is 
achieved by supplementing the first emission with hard matrix-element correc- 
tions filling these gaps. This “dead region” is shown in Fig. 5.18, Section 5.4.2, when 
the generic method of matrix-element corrections is discussed. In addition, it is worth- 
while to stress that the parton shower fills the available phase space in such a way 
that typically the first emissions are the large-angle soft ones at relatively low trans- 
verse momentum, while harder emissions are usually appearing later in the process. 
This is in contrast to the other parton shower algorithms based on an ordering of the 
emissions by their transverse momentum. 
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5.3.2.5 Dipole showers 


In a more recent development, parton showers have emerged that are based on the 
dipoles forming the Catani-Seymour subtraction kernels. This construction has for 
the first time been suggested in [778], and has been implemented in [466, 807, 839]. 
The underlying idea is to retain the notion of a well-defined splitting parton but to 
also explicitly identify the spectator parton in the splitting, whose kinematics in turn 
impacts on the splitting kernels. 


1. Splitting kernels 
This dipole shower is further specified by employing (Catani-Seymour) dipole 
splitting kernels. For the case of massless partons, the splitting kernels for the FF 
case (final-state splitter and final-state spectator) read 


(FF) = 2 1 Í š 
Kagk Cr f — z;(1 — Yij;k) (3 2) 
1 1 
gg,k $ 1— 2(1— Yijk) 1 (1 zi)(1 + Yijik) i 
(5.289) 


cf. also Eq. (3.168). They depend on a recoil parameter y;;,, and a splitting 
parameter zi, 
PiPj 
PiPj + PiPk + PjPk 
PiPk 
Zi = 
PiPk + PjPk 


Vijsk = 
(5.290) 


= i Zj. 


The FI, IF, and II expressions emerge from the FF case by replacing the final-state 
splitter momentum p; or spectator momentum pp with corresponding initial-state 
ones: Pik 4 —Pa,p- In addition, for initial-state splitters, recoil and splitting 
parameter change their roles, cf. also Table 5.2. Splitting kernels for them can 
be found in Appendix C.2; a modification for the FI and IF kernels reproducing 
exact fixed-order matrix elements has been proposed in [330]. This modification 
essentially consists of adding some non-singular terms which vanish in the soft 
and collinear limits and therefore do not change the logarithmic accuracy of the 
shower. 

It should be mentioned here that the splitting kernels above come with a factor of 
1/(pip;), a suitably normalized strong coupling factor and a normalization taking 
into account the number of spectators, 1/Nspec. The choices of relevant kinematic 
quantities of course also depend on the specific case. However, in the original 
publications they are identical to the corresponding expressions used in Catani- 
Seymour subtraction. For a massless shower, they are listed in Appendix C.2. 

2. Evolution and splitting parameters 
The formalism in principle allows different evolution parameters to be employed, 
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Table 5.2 Recoil and splitting parameters for various dipoles in a 
Catani-Seymour shower. The FI, IF, and II expressions emerge from the 
FF case by replacing the final-state splitter momentum p; or spectator mo- 
mentum px with corresponding initial-state ones: pi, + —pa,p. In addi- 
tion, for initial-state splitters, recoil and splitting parameters change roles. 


case || recoil parameter splitting parameter 
PiPj PiPk 
FF ijik = Zi = =I Zj 
A PiPj + PiPk + Pjpk (pi + Pj)Pr i 
FI Zija = PiPa + Pj Pa — PiPj gas PiPa = zj 
(pi + pj)Pa (pi + Pj)Pa 
PiPa PiPa + PkPa — PiPk 
IF ui = = 1S ük: | ika = 
(pi + Dk) Pa (pi + Pk)Pa 
PaPb — PiPa — PiPb 
II = Ti ab = 
PaPb 


including the invariant mass s;; of the pair after the splitting or its transverse 
momentum. This latter choice was the default in the original publications, i.e. 
t = k*, given by 


Pooh = 


2p;p;z;(1 — zi) for final-state emissions 
2 pipjz (l — zi) (5.291) 


2Papj(l — £a) for initial-state emissions 


for massless partons and more complicated expressions for massive ones. 
Irrespective of the evolution parameter t, the transverse momentum squared k? is 
used as the renormalization scale for the strong coupling and as the factorization 
scale, at which PDFs are evaluated in initial-state splitting processes. Various 
refinements to the definition of the transverse momentum variable with respect 
to the original proposal in [778] and its first implementations [466, 839] have been 
suggested, the most recent one in [626, 628]. There, a variation of the original 
evolution parameter t = k? in Eq. (5.291), making it flavour-specific, has been 
introduced, in order to better capture the singular behaviour of the splitting 
kernel. In particular, for massless final-state splittings (ij) + k—i+j+k the 
refined evolution parameter t reads 


T (l—2) ifi#gandj =g (5.292) 
= 2pipj + ; 
i Zi ifi = gandj Æ g 


while for initial-state splittings it is given by 
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1 — Za;k ifj = 
t = 2pap; - oiik a i (5.293) 
1 ifj Ag. 
3. Construction of kinematics 
In the original implementations, for the construction of the kinematics a recoil 
scheme has been used, which essentially is the inverse of the Catani-Seymour 
mappings. For the example of massless FF splittings, the kinematics is therefore 


constructed according to the inverse of the mapping in Eq. (3.160): 
Ppi = 2iPig + (1-21) Yije Be + ka 
pj = (l- zi) Pij + Zi Vijsk Pk — ki (5.294) 
Pk = (1 — Yijik) r- 

The other cases are detailed in Appendix C.2. Of course, k? = -k2 with the 


transverse momentum k] sitting in the plane orthogonal to both p;; and pk. 
There is, however, one caveat concerning the way kinematics is constructed. In the 
FF case, the Catani-Seymour simply had been inverted. For initial-state show- 
ering the situation is not quite as obvious. Merely employing the inverse of the 
Catani—Seymour kinematics, as in the FF case detailed above, would lead to a vi- 
olation of the NLL accuracy. Applying the logic inherent to the Catani-Seymour 
language, consider the case where, say, an incident quark—anti-quark pair produc- 
ing a gauge boson emits a gluon. In the first gluon emission, there is no other 
final-state particle, and therefore the gauge boson must compensate for the trans- 
verse momentum by recoiling against this gluon and thereby experiencing a k1- 
” kick”. This first gluon would decouple the quark—anti-quark pair in colour space. 
Thus, for the next emission off one of the incoming quarks, the gluon emitted first 
would act as the spectator in this IF splitting and naively take all the recoil and, 
in particular, compensate for the transverse momentum. This is to be contrasted 
with the master formula for Qr resummation in the hadro-production of colour- 
singlets, cf. Eq. (2.166) and Eq. (5.59). There, the overall transverse momentum of 
the singlet results through the coherent sum of all gluon emissions in the Sudakov 
form factor, plus, eventually, some hard corrections. It is therefore mandatory to 
subject the complete final state — and not only the spectator — to recoils in 
transverse momentum in initial-state splittings with final-state spectators. This 
has very explicitly been worked out in [807], where in addition the issue of the 
potential double-counting of the soft region when using naive splitting kernels 
has been addressed. As a result, the authors explicitly showed that parton show- 
ers based on Catani-Seymour splitting kernels with transverse momentum as the 
evolution parameter and a good recoil strategy exhibit the correct logarithmic 
behaviour. As a consequence, by now, the kinematics of emissions off initial state 
particles with a final-state spectator is constructed in such a way that the full final 
state receives a transverse momentum “kick”. Further details are given in [807]. 
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5.3.2.6 Antenna showers 


Finally, there are antenna showers (sometimes also called “dipole showers” ), which 
in contrast to the dipole showers discussed above do not make any distinction between 
splitter and spectator parton but rather consider all secondary emissions as coherent 
radiation of an antenna consisting of two, typically colour-connected partons [602, 
603]. This physical picture was first realized for final-state radiation in the ARIADNE 
program [729]. The absence of defined splitter and spectator partons translates into a 
notion of splitting kinematics as 


Dit De > pit pj + pk (5.295) 


for final state splittings. This leads to light-cone fractions x; (with l € {i, j, k}) of the 
outgoing partons given by 
_ 2p1Q 


Tı = Q? 
As before, Q! = pi + py = p} + ph + ph is the total momentum of the antenna — 
the splitter and spectator — and Q? = sij. For massless partons the relevant phase 
space can be parameterized by a generic Lorentz-invariant quantity which reduces to 
transverse momentum in a suitable frame and a generalized rapidity of the emitted 
parton: 


(5.296) 


wR = 285 ond y = Log 2, (5.297) 
Sijk 2 Sik 

Note that with the evolution parameters k, and y as defined in Eq. (5.297), phase- 

space boundaries in kj, ky > kle, translate into limits on y, ensuring finite inte- 

gration volumes and, hence, a Sudakov form factor that can be evaluated in a fairly 

straightforward way. 


1. Splitting kernels 
Writing unnormalized differential splitting probabilities in striking similarity with 
Eq. (2.2) as 

dk? 


dP ips i jk = p” 


dọ 
On ik ijk (5.298) 


allows corresponding splitting kernels to be defined. In antenna showers this is typ- 
ically achieved by deducing them from suitably chosen matrix elements through 


dọ IM xn" 


dP; on Wye , 


Thigh = dki dy (5.299) 


thereby extracting the 1/k? singularity from the real-emission matrix element.® 


8Ultimately, such an approach means that instead of exponentiating the singular terms only in 
the Sudakov form factor, full matrix elements are used and exponentiated in a way similar to what 
will be introduced as a matrix-element correction in Section 5.4.2. It is not hard to imagine that this 
is the source of the success ARIADNE enjoyed in describing QCD data from LEP. 
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In the case of FF splittings, and for gluon emission off a quark—anti-quark an- 
tenna, typically the matrix elements for y* — qq(g) are used. Using that for any 
combination {l, m, n} of the three massless final-state vectors 


Sim. = 201m = O71 — zn) (5.300) 
this leads to 


dọ Cras T2 + 02 
12m Qn (1—24)(1—2Q) 
_ dsg dSqg dp Cras (Q? — sgg)? + ("Sa 


dP; 


ik=>ijk T 


= 5.301 
Q? Q? m 2r SqgSqg ( ) 

_ dkî a dọ 2Cras (1— az, e¥)? + (1-— xie”)? 

~ k? Y Qn 2m 2 i 


Here, in the last line, 


Sag.ag = yk QR = QPae*¥ (5.302) 


has been employed. With similar considerations, and using slightly different pro- 
cesses, splitting kernels for other splittings such as gg —> ggg, qg — qqg'q' etc. have 
been obtained for the FF case, for instance in [805]. 

For splittings involving initial-state particles, two approaches have been pursued. 
First of all, in the original implementation of an antenna shower in ARIADNE, 
initial-state radiation has been re-interpreted as final-state radiation. This is 
achieved by replacing the initial-state partons of a given antenna by the corre- 
sponding hadron remnants in the final state, with the typically enhanced phase- 
space due to their larger momenta being compensated by tunable phase-space 
cuts. The origin of this idea can be traced back to [164], where emissions off 
an extended colour source, such as a hadron remnant in a collision induced by 
incoming hadrons, have been discussed. 

An alternative approach, in parallel to the treatment in the other parton shower 
algorithms, has been worked out in [895] and, in the framework of the VINCIA 
code [577], in [826]. In this approach, emissions from the IF and II antenna are 
treated in a standard perturbative language with suitably defined splitting kernels 
obtained in a way very similar to the FF case. 


2. Evolution and splitting parameters 
The customary evolution parameter is the transverse momentum given in the form 
of Eq. (5.297) or analogous for antennae with initial-state particles. Instead of an 
explicit energy splitting parameter, usually, the rapidity of the emitted parton in 
the c.m.- or Breit-frame of the emitting antennae is used, which of course can be 
translated into light-cone splitting parameters. 


3. Construction of kinematics 
In the FF case, the construction of the splitting kinematics is best understood in 
the rest frame of the antenna. In this frame the parameters x;, £j, and x, yield the 
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energy fractions of the outgoing partons with respect to the full antenna. Orienting 
the original partons along the z axis and assuming them to be massless, 


Dik = Q ; (5.303) 
2 

The first way, usually implemented, for instance in ARIADNE, is to have one of 
the partons i and k keep its direction (given by 7 or k) in the rest-frame of the 
antenna and just rescale its momentum by the corresponding x. With the absolute 
value of the transverse momentum fixed by k1, only the azimuthal angle ¢ with 
respect to the original axis must be selected. The parton which keeps its direction 
typically is chosen in the following way. In the case of gluon emissions by a qq or 
a gg antenna, the less energetic of the two partons, i.e. the one with the smaller x 
takes the transverse momentum, while the more energetic one, the one with the 
larger x, just has its momentum compressed by its x. If the gluon is radiated off a 
qg or a qg antenna, it is always the quark that retains its direction. Finally, in the 
case of a gluon splitting into quarks it is always the other parton that keeps its 
direction. Alternatively, phase-space mappings used in antenna subtraction for 
NLO calculations [430, 561-563] could be employed. Emissions from IF and II 
antennae are treated in very different ways in different realizations of the antenna 
shower paradigm, although the same reasoning as for the dipole showers applies. 
Care must be taken to ensure that these splittings continue to transfer transverse 
momentum to the full final state in order to ensure the logarithmic accuracy in 
the description of, say, the transverse momentum of Drell-Yan pairs or similar. 


5.3.2.7 Accuracy of shower algorithms 


In general, it is a non-trivial task to work out the formal accuracy of parton showers, 
and for a variety of reasons. 

First of all, it is important to realize that the formal accuracy of a parton shower is 
related to the logarithmic order that it controls. Formally speaking, it is synonymous 
with the question of which towers of a” log””~™ are correctly described by the parton 
shower. But this, of course, is only a very simplistic way of looking at it: it is of course 
also relevant which logarithms, i.e., which arguments, are thus treated. With the parton 
showers providing completely exclusive partonic final states it is obvious that not all 
logarithmic observables can be described at the same formal accuracy level. One could 
therefore easily imagine situations where one class of fairly inclusive logarithms, like, 
for example, logarithms in the transverse momentum of the lepton-pair in Drell-Yan 
production or logarithms in the thrust observable in electron—positron annihilations 
to hadrons are described at a fairly large accuracy (i.e. relatively large m). At the 
same time other, more exclusive observables, such as the correlation of the two leading 
jets in Drell-Yan processes or the thrust minor distribution in e~e+ — hadrons are 
not well described at all. When discussing the formal logarithmic accuracy of a given 
parton shower algorithm it is therefore mandatory to specify not only the level N” LL 
but also the arguments of the logarithms. 

At this point, another comment is in order. It is true that using k? as an evolution 
parameter in splitting processes where a potentially soft gluon is emitted correctly 
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captures NLL effects in the parton shower evolution. The issue of formal accuracy is 
not as clean-cut for the case of a gluon splitting into a quark—anti-quark pair. This is 
because, formally speaking, this kind of process first appears at NLL accuracy such 
that the freedom in the evolution parameter may yield a sub-leading effect related to 
that choice. The same reasoning also applies to the choice of renormalization scale. 
The effect of different choices on the invariant mass distribution of secondary quark 
pairs is quite large, and especially so for the gluon splitting into heavy quarks, where 
the differences of relative transverse momentum of the two quarks and their invariant 
mass can be quite large. 

Furthermore, there are a number of other obstacles when trying to analyse the 
formal accuracy in the radiation pattern produced by a given parton shower. There 
is a subtle way recoil schemes — the way the splitting kinematics is constructed for 
each branching — impacts on observables and, possibly, on the logarithmic accuracy. 
The most obvious example has already been discussed. For dipole or antenna showers, 
systems made from an initial-state splitter with a final-state spectator naively may 
decouple their kinematics from the rest of the parton ensemble and, in particular, from 
the other final-state particles. As a consequence, in this case the transverse momentum 
of the individual splitting is no longer transferred to the other final-state particles, and, 
consequently, the resummation of logarithms in transverse momentum breaks down. 

Correspondingly, it must be stressed that parton showers automatically implement 
detailed four-momentum conservation in each emission of an additional parton. In 
contrast, by far and large, this is not the case in analytic resummation. In most cases 
the effects of momentum conservation induce non-logarithmic corrections (“power— 
corrections” ) of the form k1 /Q with Q a typical scale related to the splitter-spectator 
pair. It is not entirely clear if (and how) such effects produce additional logarithms 
when convoluted over many emissions. 


5.3.3 Emissions off a Born-level configuration 
5.3.3.1 Hardest/first emission 


To understand the dynamics provided by the parton showers in more detail, and to 
develop the formalism further, consider the differential cross-section for the emission 
of the first — typically the hardest — parton off a core process, modelled at the Born 
level. Note that, due to its probabilistic nature, the parton shower does not change 
the Born-level cross-section for a state with N external particles given by Eq. (3.1). 
However, the process at hand and the specifics of its parton configuration, given by 
their flavours and momenta, will influence the parton shower by providing it with the 
scale uQ, the upper limit for further parton emissions. Thereby, this scale ug also 
defines the hard scale for the logarithms that are resummed by the parton shower 
evolution. 
The radiation pattern up to the first emission is given by the parton shower as 
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HQ 
doÇ™ = d&g By(dg) 4 AY (ub, te) + ddi [Ky (1) AP (nd, tE) 
te 
(5.304) 
Here, combined splitting kernels for emissions off an N-body state, Kyn (®1), are in- 
troduced and read 
Kn(®1) = 5 Kiji (®1). (5.305) 
{ij;k} EN 
In this equation, of course, the sum over {ij;k} covers all viable combinations of 
emitting, emitted, and spectator partons. Their very nature as an exponential allows 
the introduction of compound Sudakov form factors, 


AY Ose AT (5.306) 
{ij;k} EN 


Furthermore, t(®1) is the parton shower scale associated with the emission phase space 
given by ®. 

The first term in the curly bracket above gives the no-emission probability down 
to the parton shower cut-off scale te, while the second term takes into account the first 
(hardest) emission at t with the no-emission probability at higher scales again encoded 
in the Sudakov form factor. The sum of the two terms integrates to unity reflecting the 
Born configuration to either emit a parton or not. This simple probabilistic reasoning 
typically is dubbed the “unitarity” of the parton shower. As a consequence, the cross- 
section thus simulated is identical with the Born level one, while the pattern of the 
first emission is determined by the parton shower. 

Looking at the second term, the emission term, however, it is clear that further 
emissions at lower scales should also be included. In fact, they emerge by iterating the 
curly bracket in an appropriate fashion, leading to an expression of the form 


HQ 
do PS) = dög By (Oz) AN? (ub, te) + f da Key (1) AN (Wa, (1) 
te 


t 


x 2A (t, te) + J dad, Kna (®) AW, El), E) 


te 


xg AVO Ee) +... 


(5.307) 


This is the explicit manifestation of the idea that in the soft and collinear limits 
multiple emissions off a given parton configuration can be constructed recursively, 
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taking advantage of the factorization of matrix elements and the corresponding phase 
space, 


dOy41Bys1(®v 41) = d®yBy(®y) Kn dd O (uZ (PN) — t(®1)), (5.308) 


which of course leads to 


=E 


dÖËN4mBN+m(ËN+m) = dnBn (Ën) [Knri daf” o(i-» — )) . (5.309) 


i=l 


Here, tp = LO is defined by the N-particle Born configuration. This equation in 


fact represents the expression for the leading order cross-section for (n + m) particles 
extracted from Eq. (5.307), i.e., where the higher-order contributions encoded in the 
Sudakov form factors of Eq. (5.307) have been ignored. 

To allow a more compact notation, a parton shower all-emission operator Ef) (Ma, te) 
is introduced defined recursively through 


HQ 
K 
EL (ud, te) = AKO (ub, te) + / a, [Kn (G1) AM) (ud, (1) @ ELE), te)] 
te 
(5.310) 
which, due to its recursive nature, also takes care of all further emissions. Applied to 
Eq. (5.307), this yields the compact expression 


do” = d&g By (Og) EP (u2, te) (5.311) 


for the Born-level cross-section and parton configuration, dressed with all possible 
emissions. 


5.4 Matching parton showers and fixed-order calculations 


In this section, techniques will be reviewed, which allow the combination of fixed-order 
results for exactly calculated matrix elements with parton showering algorithms. 


5.4.1 Motivation 


Results from fixed-order matrix elements and resummation incorporated in the parton 
shower provide good descriptions of essential event characteristics in complementary 
regions of phase space. The same also holds true for analytic resummation techniques, 
like the Qr resummation scheme discussed in Section 5.2. There this complementar- 
ity results in supplementing the resummation part in Wij; Eq. (5.60), with a hard 
remainder part Y;;. It encodes the difference between the logarithmically enhanced 
contributions from the Sudakov form factor and the full fixed-order extra emission 
part of the real correction. In addition, virtual corrections can be encoded by adding 
the loop correction; in Qr resummation this is the term Ha», which quite often is 
absorbed as a part of Wij. It thus allows the systematic correction of the approximate 
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resummation result, order by order, to the full result. In particular, the total cross- 
section of the process can be recovered, order-by-order, and the radiation pattern of 
hard emissions approaches the fixed-order result. 

This pattern can also be found in parton shower simulations, where the effect of 
corrections to the fixed-order results becomes most prominent in observables sensitive 
to additional emissions, real or virtual. For example, due to its probabilistic nature, 
the parton shower fails to account for any higher-order effects impacting on the total 
cross-section of the simulated process. This could be cured trivially by multiplying 
the cross-section fed into the parton shower with a suitable global K-factor. Such 
a treatment would provide a fairly satisfying solution to the problem, provided the 
patterns of additional particle radiation of the underlying fixed-order calculation and 
the parton shower were not visibly different. This, however, is not necessarily the 
case, and typically parton showers and fixed-order calculations differ substantially in 
large parts of the emission phase space. In fact, fixed-order calculations are primed 
to correctly describe the emission of additional, highly energetic particles at large 
angles and typically fail to describe the softer and more collinear emissions due to the 
occurrence of associated large logarithms which eventually may overcome the smallness 
of the perturbative parameter, the coupling constant. In contrast, the parton shower, 
constituting an expansion around the soft and collinear limits of particle radiation 
excels at describing such emissions, while it typically is incapable of taking into account 
the more complicated pattern of hard emissions. Stated in a slightly more extreme way: 
in order to capture the full effect of the quantum nature of particle emission, quantum 
field theoretical methods have to be applied — in practical terms this typically means 
that full matrix elements must be calculated. They can then be used to correct the 
classical parton shower picture in a systematic way, similar to the way this is achieved 
in analytic resummation. 

Broadly speaking, methods to combine parton showers and fixed order matrix 
elements aim at combining the best features of both: of exact order-by-order calcula- 
tions, which capture all quantum interferences and, possibly, higher-order effects due 
to virtual corrections, and of the parton shower, which, depending on its formulation, 
provides a simulation of further soft and collinear emissions to leading or next-to- 
leading order accuracy. The aim of such a combination exercise always is to maintain 
the fixed-order accuracy for the overall cross-section. At the same time, the fixed-order 
accuracy of the hardest emissions should be guaranteed, but supplemented with those 
leading logarithms that are captured by the parton shower’s Sudakov form factor. In 
addition, all further, softer or more collinear emissions must still be described at the 
intrinsic accuracy of the parton shower. 

The persistent problem in any combination procedure, however, is that both the 
matrix elements and the parton shower may allow for the emission of additional partons 
off a core process, which would lead to an unwanted double-counting if not properly 
taken into account. 

In order to appreciate fully the difference of fixed-order calculations and parton 
showers and the problem underlying any combination of the two, consider the diagram 
in Fig. 5.16, where the orders of ag and the accompanying logarithms L are depicted 
for the case of resolvable parton emissions in e~et — gq + X. One could think of 
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Fig. 5.16 Orders in a, and large logarithms for emissions in e~ e™ anni- 


hilations to hadrons. 


such resolvable partons as jets, defined by a suitable algorithm. Obviously, for every 
additional jet emission, the order of as is incremented by one, and up to two new 
logarithms could emerge. This is the pattern of orders in a, and L at leading order; 
inclusive higher-order corrections typically only add an order in the coupling without 
necessarily introducing new logarithms. Therefore, in this diagram the fixed-order 
matrix elements, being of a fixed order in as, live on one (leading order only) or more 
(higher-order corrections) vertical lines; in contrast, the parton shower, resumming 
terms of the form aL?” and possibly also terms a,L?"~!, occupies the diagonals. 
These lines, vertical and diagonal, cross. This indicates a double-counting, which could 
either be positive, as an over-counting, when contributions from both are wrongly 
added, or negative, when contributions are missed in both. 

In this and the following sections, a variety of existing methods to include higher 
fixed-order terms into the parton shower will be presented, starting with a discus- 
sion of matrix-element corrections (MEC) [215, 416, 417, 786, 843] in this sec- 
tion. This method effectively allows the inclusion of the full O (as) kinematics into 
the parton shower, but without including the effect of the O (as) on the total rate. 
This will be achieved in the next section by introducing existing NLO matrix-element 
parton-shower matching algorithms (NLOPS) [543, 546, 630, 782]. An alterna- 
tive approach, aiming at combining multiple fixed-order calculations into one inclu- 
sive sample is by now known as multijet merging methods at leading order 
(MEPs) [351, 695, 730, 744] and at next-to-leading order (MEPS@NLO) [535, 
558, 631, 737]. Finally, an outlook is given to recently devised methods to even in- 
clude NNLO matrix elements for simple processes into the parton shower simulation 
(NNLOPsS) [610, 633, 634, 652]. In most cases technical details will be ignored in 
favour of clarity of the presentation. Readers interested in more technical and im- 
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plementation details and proofs of the respective accuracies are referred to the vast 
literature on this subject. 


5.4.2 Matrix-element corrections 
5.4.2.1 General idea 


The first approach to improve the radiation pattern of the parton shower in the region 
of hard emissions concentrated on the first (hardest) emission [215, 416, 417, 786, 843], 
correcting it by the exact matrix element. This technique uses the fact that in many 
processes the differential emission cross-section for one additional parton as given by 
the parton shower exceeds its exact counterpart, given by the matrix element, in the 
full emission phase space. Examples for such a behaviour, where the technique sketched 
below has traditionally been implemented, include the processes e~ e+ — qqg,t > bWg 
or the emission of an additional parton in the production of vector bosons at hadron 
colliders. In other words, 


Rn (Sz x ®,) < Bn(®z) Q Kn(®1), (5.312) 


where Ry denotes the real correction to the process with N external particles, i.e. a 
matrix element for (N + 1) external particles at Born level. The trick of the method 
is to assume a modified splitting kernel Ky, given by 


Ky(®1) = Rn (®g x ®1)/By (zp) (5.313) 


and to use this kernel for the description of the first emission in the parton shower. 
It should be stressed, though, that rather than evaluating the real-emission term Ry 
at a fixed scale, it is customary to use the same kinematics-dependent scale as in the 
parton shower. 

This implies that the equation for the Born cross-section including the first emission 
through the parton shower of Eq. (5.304) in the section above becomes 


doh") = dds By(bs) 4 AP ut) + f ads [RH AUR, E) 


te 


2 


= d&g By(®z) AG! (ub, te) f do, | 


te 


Rn (®e x $1) | (R/B); 2 
= H AN (Ha; H(@,)) 


(5.314) 


Again, the terms in the curly brackets integrate to unity, indicating that the simulated 
cross-section is identical to the Born level one. This time, however, the radiation 
pattern is determined by the full real radiation matrix element as encoded in Ry 
rather than by the parton shower. This can trivially be seen by expanding the emission 
term in Eq. (5.314) up to first order in the coupling, keeping in mind that Ry has one 
more power in a, than the corresponding Born term By, and that effects of running 
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Qs are of higher order as well. All further emissions are still driven by the original 
parton shower kernels. 

In practical implementations, usually the original splitting kernels and parton 
shower evolution parameters are used, and the parton shower is corrected to the mod- 
ified splitting kernel through simple reweighting. Algorithmically, this means that first 
test emissions are generated by the original parton shower and accepted with a prob- 
ability given by 

Ky (1) Rn(®z x 21) 


Pmec = Kela) Bre) x Kwa)’ (5.315) 


5.4.2.2 Example: First emission in eet > qq 


Using e~e* — q@g to further illustrate the idea, the first step is to compare the 
differential cross-sections for this process in the matrix element and parton shower 
approach. The matrix element given by the Feynman diagrams of Fig. 5.17 yields [498] 


Cras z? + x 
2m (1—421)(1— z3) 


AD, Raal(Pag X Pg) = Bag(Pqq) X dx daz (5.316) 


where 
£13 = 2E4,3/Ec.m. € (0, 1] (5.317) 


are the energy fractions the massless quark and anti-quark carry after gluon emission. 
The gluon emission phase space, after performing the azimuthal integration, is given 
by d®, x da, dzz. 


Pı Pı 
P2 P2 
P3 P3 


Fig. 5.17 Feynman diagrams for the emission of an additional gluon in 
quark-anti-quark pair production in lepton annihilations. 


The parton shower expression has to be obtained from the details of the map 
relating its variables to the splitting kinematics. For the case of a virtuality-ordered 
parton shower with z defining the energy components in the splitting, the virtual mass 
and energy-splitting variables are given by 


tg = m3; = (pı + po)? = E? m (1 — 2s) 
A (5.318) 


Then the O (as) expression for the gluon emission by the parton shower becomes [215] 
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dt; Crag 1 +2? 
d®, Bya( Ba ac) = Byq(Paq) X 5 dzi = 


ti 2m 1-4; 


Cras dzıdz3 
Qn (1-— zxı)(1-— 23) 


JEE i i (25) ] i" i (2) |} . (5.319) 


For the angular-ordered parton shower in HERWIG, the situation is a bit more compli- 
cated due to the somewhat non-trivial phase-space limits, leading to 


Cras dzıdz3 
2m (1—2,)(1— z3) 


ddy [Baal®a) x K184)] = Bya( Bae) x 


x 


-15° 
14 (=+) | +zı z3, (5.320) 


Tı zı > 1—z(1-— z) 
23 >1l—2+ 22, 


where the splitting parameter z is given by 


1 Tı + 223 m 1 
= t 5.321 
=p 271 (a 


for massless quarks. 
If instead a parton shower based on Catani-Seymour splitting kernels was used, 
the evolution parameters are k? and z; with 


k2 = E?’ Yijik zi(1 = zi) 
PiPj 


Yij;k = = l- Tk 
v PiPj + PjPk + PkPi (5.322) 
‘(pi +pj)pk 2-2; - Xj Tk 


Using the dimensionless parameter yYij;k, with 


dk? — dyij; 
IL = ik (5.323) 
k3 Yij;k 

yields as the Catani-Seymour parton shower result for gluon emission in e~e+ > qq 


Cras dx,dx3 
d®, [B® x K) = Bgq(®qq) x Qn (1—2)(1—23) 


«{ [at +23] ! for ! Cze a) (5324 


T3 Tı 


In all cases, the acceptance weight is given by the ratio of z? +23 and the terms in the 
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Fig. 5.18 The ratio of the matrix element and the parton shower ex- 
pression for the differential emission cross section of an additional gluon 
in quark—anti-quark pair production in lepton annihilations, Eq. (5.319) 
(virtuality-ordered parton shower, upper left), Eq. (5.320) (angular or- 
dered parton shower, upper right) and Eq. (5.324) (Catani-Seymour parton 
shower, lower left). 


curly brackets. Of course, further parton shower formulations may lead to yet different 
weights, depending on the details of the map between the parton shower parameters 
and the kinematical quantities parameterizing the matrix element. 

The algorithm then is to generate an emission through the parton shower, and to 
accept it with a probability given by the ratio of the two expressions above. Profiles 
for this ratio for different parton shower implementations are depicted in Fig. 5.18. 


5.4.2.3 Limitations 


While this technology is fairly transparent and straightforward in its implementation, 
it is limited in its applicability. First of all, it depends on the parton shower expression 
(or a suitable multiple of it) to be larger than the corresponding exact matrix element 
in all emission phase space, which is not always the case. This is especially true for 
production processes at hadron colliders, where the huge phase space for emissions off 
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the initial state is not necessarily filled by the parton shower, see below. In addition, 
going to higher multiplicities the analytic parton shower expression becomes increas- 
ingly untraceable, which translates into the problem that the rejection weight cannot 
be constructed any more. 

As already mentioned, there are situations, where the parton shower does not 
entirely fill the phase space. There are two possible reasons: 


1. In cases like the radiation of extra partons in production processes at hadron col- 
liders, the emission phase space is typically constrained by an upper scale given 
by the kinematics of the hard process. For example, in the production of vector 
bosons, such a scale would be given by the (virtual) mass of the vector boson. 
Since the transverse momenta in any parton emission generated by the parton 
shower are bounded from above by the scale of the previous process, as a conse- 
quence, only transverse momenta below the mass of the vector boson would be 
generated — the parton shower would completely miss the large-p, tail of the bo- 
son kinematics. This problem naively could be cured by opening up the emission 
phase space through a harder starting scale of the parton shower and a suitable 
matrix element reweighting. This is sometimes referred to as “power shower” in 
the literature, while sticking to the boson mass is dubbed “wimpy shower” [858]. 
However, the natural question then arises which scale-logarithms would actually 
be resummed in the Sudakov form factor in each case, and it quickly becomes 
apparent that in the power shower case the link to the standard analytic resum- 
mation techniques is broken. There, the upper limit of the &,—integral in the 
Sudakov form factor is given by the boson mass, at odds with the power shower, 
where the upper limit is given by the energy scale of the hadronic collision, a 
violation of factorization theorems underlying all perturbative calculations. 

2. Secondly, it is possible that the parton shower, while perfectly in line with re- 
summation techniques, misses some regions in phase space by construction. This 
is particularly true for angular-ordered parton showers like the ones implemented 
in HERWIG [415, 750] and HERWIG++ [188, 579], see the upper-right panel of 
Fig. 5.18 for the case of gluon radiation in e~e+ — qq. In this case the soft 
matrix-element correction discussed so far needs to be supplemented with 
a hard matrix-element correction. This essentially boils down to using the 
matrix element to fill the phase-space region omitted by the parton shower. To 
guarantee a smooth transition, in this hard correction renormalization and factor- 
ization scale definitions as in the parton shower are used as well as some Sudakov 
weight for the intermediate quark line. 


5.4.3 Next-to-leading order matching — the POWHEG method 
5.4.3.1 Underlying idea 


To combine an NLO calculation with a parton shower, it is essential to ensure that the 
result inherits the total cross-section from the fixed-order (NLO) calculation and that 
the radiation pattern to first order follows the real emission part of the calculation. 
In addition, from a parton shower point of view, it is also important to maintain its 
intrinsic logarithmic accuracy, which is substantially harder to achieve and to prove. 


366 QCD to All Orders 


The fixed-order part of these requirements could trivially be achieved by multi- 
plying a parton shower simulation with a suitable global K-factor and by applying 
a matrix-element correction in the style discussed in Section 5.4.2, to the first emis- 
sion. The trick to improve this simple recipe, and the idea underpinning the POWHEG 
method, however, is to define a local K-factor in such a way that the integral over 
the phase space of the Born-level configurations yields the full result at next-to lead- 
ing order. To see how this works in more detail, remember the form of the total NLO 
cross-section 


oh) = Jass | By (28) + Vn (2B) + ZK (Ps) | 
(5.325) 
+ [ an [Rx(®r) - Sw(r)] 


from Eqs. (3.120) and (3.130). 


5.4.3.2 Local K-factors 


Using the logic of the subtraction method and the implicit factorization of the real 
emission phase space into a Born-like phase space and a one-particle emission phase 
space, it is simple to construct expressions for Born-like configurations B, which yield 
the cross-section fully accurate at next-to-leading order level. Symbolically they are 
given by 


Bn(®g) = B(x) + Pv(bx) + | a, | Ra (s @ 41) ~ Sr@s 82) , (5.326) 


where the sum over different subtraction terms is understood, and where the renor- 
malized and infrared-subtracted virtual contribution has been combined into 


Vy(®g) = Vv(g) + (Sz) (5.327) 


This construction only works out, if the full real-emission phase space ®r can be 
written in the factorized form as 


bp = 434). (5.328) 


If this is not the case, the additional phase space is guaranteed to be infrared finite 
and the correct result of Eq. (5.325) can be recovered by merely adding the difference, 
a difficulty which will be ignored in the following. In their absence Eq. (5.326) indeed 
yields the full NLO cross-section upon integration over the Born-level phase space ®g. 

The terms B therefore can be interpreted as fully differential cross-sections of Born- 
level configurations with a next-to-leading order weight, or, stated slightly differently, 
as Born-level configurations modified by a local K factor. The next-to-leading order 
accuracy of course would not be spoilt by any unitary parton shower added to it, 
so one just has to ensure that the pattern of the first emission is correct up to first 
order in a; in order to arrive at a fully NLO accurate simulation. Going back to the 
technology of matrix-element corrections introduced in Section 5.4.2 indicates how this 
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can be achieved: the full real-emission matrix element for the first emission must be 
used, this is easily achieved by essentially replacing the parton shower kernels K with 
R/B. Therefore, combining the B terms with the first-order correct radiation pattern 
of Eq. (5.314) results in a simulation which is correct to first order of the coupling 
for both the inclusive cross-section and for the emission of the hardest parton. This 
essentially is the core of the POWHEG method introduced for the first time in [543, 782] 
and in heavy use ever since. 

In this matching method, the differential rate up to the hardest emission is given 
by 


m 
Ry(® ® 
2 AG! gtt f dð: aes AN! (ug, (1) 
te 


(5.329) 


It is straightforward to prove that this yields the correct NLO cross-section, since the 
term in the second line integrates, again, to unity. In order to see that the radiation 
of the first/hardest emission follows the exact real correction matrix element at first 
order in the coupling, it suffices to note that the term Ry /By already is at first order 
in the coupling. This allows to ignore, at this accuracy level, all next-to-leading terms 
in By, i.e. all terms stemming from real or virtual corrections or the corresponding 
subtractions, since they would yield terms of order O(a?). 

One subtlety ignored so far is the choice of the “right” renormalization and factor- 
ization scales in both By and in Ry /By. By far and large it is advantageous to keep 
to the choices made in the parton shower in the emission term, i.e. Ry /By. For the 
integrated emissions in By, on the other hand, it is probably better to keep choices in 
line with the choices also made in By and especially in the virtual part Vy in order 
not to spoil the exact cancellation of infrared divergences. In all cases, such choices are 
essentially of higher order in as and therefore do not hamper the fixed-order (NLO) 
accuracy of the approach. 

Similar to the case of matrix-element corrections, however, there are a number of 
pitfalls beyond fixed-order accuracy. This is especially true for processes at hadron 
colliders; as an example consider the case of Higgs boson production in gluon fusion. 

First of all, as in the case of matrix-element corrections, it is not clear what scale to 
pick as upper scale for the parton shower evolution. Standard resummation technology 
suggests choosing a scale of the order of the Higgs boson mass my as an argument 
in the logarithms, i.e. ug ~ my. This choice however does not allow a description 
of transverse momenta of the Higgs boson in the high-p, tail, at scales above my. 
Conversely, choosing ug = my in the equation above, Eq. (5.329), will automatically 
constrain the phase space available for the hardest emissions to scales below mz. 
This is at odds with maintaining O(a,) accuracy over the full emission phase space. 
Without modifying the algorithm outlined up to now, therefore a choice must be made 
between logarithmic and fixed order accuracy of the approach in the high-p, tails of 
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additional emissions. 

Assuming that the full phase space is opened up for the hardest emission, i.e. 
UQ — Ecms of the hadronic collision, a second question naturally arises. The local K- 
factor encoded in By corresponds to the inclusive production of the n-body final state, 
and in particular after integrating out additional partons in Ry. By construction it is 
applied to all events, and in particular to those where the hardest emission is harder 
than the typical scale related to the n-particle state. This of course is questionable, 
since a priori the K-factors for n-particle and for (n + 1)-particle final states do not 
coincide. It is of course possible that such discrepancies are not so large, and that 
therefore the tails of large-p, of the produced system (in the example here the Higgs 
boson) are well described. An illustrative study of this problem is depicted in Fig. 5.19. 
However, as can be seen in the left panel of this figure, the tail of the distribution 
differs significantly from the fixed-order, i.e. NLO, result. Even more, comparing with 
another NLO matching method, MC@QNLO discussed in the next section, Section 5.4.4, 
it appears as if the latter interpolates between a low-p, regime, where a K factor is 
applied for both, which therefore agree with each other, and the region of large p_, 
where only the POWHEG simulation is modified by the K factor. In the right panel, this 
difference is unambiguously traced back to the influence of the K-factor. Replacing 
the Born-term in the denominator of the emission kernel R/B in the Sudakov form 
factor with B, Eq. (5.329) schematically becomes 


2 
HQ 
dot — ddz By < AGP (u3, te) +f dð, E aga] , (5.330) 
te 
where the by-now familiar phase-space arguments in the individual pieces have been 
omitted. In this form, the higher-order enhancement is cancelled, and the pı spectrum 
of the NLO result is recovered. Keeping, on the other hand, the original form of 
the emission kernel in Eq. (5.329), the result resembles more the NNLO result. It 
appears however that this is purely a coincidence and not related to any systematic 
improvement. 


5.4.3.3 Improving the POWHEG method 


One way of solving both potential pitfalls outlined above is to decompose the real 
emission phase space into a “soft” and a “hard” part. This is essentially achieved 
by defining soft and hard real emission matrix elements RS) and R® with support 
only in the corresponding phase-space regions. The former would contain the infrared- 
divergent parts and thus have to be suitably subtracted, while the latter would be free 
of infrared divergences. As a further result, the soft part only would be employed for 
the definition of the local K factors modifying By to obtain By and for the parton 
shower kernel for the hardest emission off such configurations, while the hard part 
would be added separately, with the parton shower defined for all emissions through 
its usual kernel. 

As an improvement of the POWHEG implementation this was first discussed in [139], 
and the same idea has been used for the POWHEG simulation of various other processes 
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Fig. 5.19 POWHEG predictions for the Higgs boson p,, p. In the top 
panel the inclusion of the parton shower on př (POWHEG +HERWIG) is 
compared with the NLO result and with the distribution where only the 
first emission is simulated (POWHEG). In the lower panel a comparison is 
made where R/B is replaced by R/B and with the NNLO result. (Figures 
taken from [139].) 
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since then. Specifically, the decomposition of phase space is achieved with a smooth 
function. For the example case of Higgs boson production in gluon fusion it reads 


h? pi ©) pt) 
= Bs 
Rn =Rn (> Fp T A L) = Ry FRN (5.331) 
with p] the transverse momentum of the Higgs boson. The new parameter h will 
typically be of the order of the relevant resummation scale of the underlying Born 
configuration, in the example here therefore h ~ my. Alternatively it could be “tuned” 
to exact higher-order calculations or calculations involving the resummation at higher 
logarithmic accuracy.? 

Omitting again the phase-space arguments in the various parts, the differential 
rate up to the first emission in this improved formalism therefore reads 


2 
HQ 
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te 


+ dör R™® 
(5.332) 
with a modified local K-factor defined through 
B= B+Vy+ / d®, R® — s| ; (5.333) 


The first line describes emissions in the soft regime, multiplied by the modified K- 
factor. For the example of Higgs boson production, it thereby experiences an enhance- 
ment. The second line describes the radiation in the hard regime, and it is not modified 
by any K-factor. 


5.4.3.4 The interface to the parton shower 


In its inception the POWHEG method was presented as an algorithm to promote ar- 
bitrary showers to higher fixed order, and in particular NLO, accuracy. There are, 
however, some caveats to this, which have to do with the fact that POWHEG works for 
ordering the emissions through their hardness, typically identified through a quantity 
like transverse momentum. However, as has already been seen in Section 5.3.2, there 
are about as many definitions of such a quantity as there are parton shower algo- 
rithms. For instance, in many cases the transverse momentum of a particular splitting 
is defined with respect to a well-specified spectator parton; but in the commonly used 
interface structures the information of a spectator parton is not provided, and the 
parton shower algorithm may make other choices than the matching code feeding 
parton-level configurations into it. In addition, for many showers, only one such hard 


9This indeed is the case for the illustrative example, Higgs boson production in gluon fusion, where 
h has been fixed to a value of h = 1.2m p by comparison with the result of NNLL resummation [442, 
596]. 
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scale is provided globally for the full parton ensemble, which may therefore have a 
meaning which in extreme cases may lead to sizable mismatches. Consequently, this 
leads to an additional source of uncertainty purely related to the matching, which 
could and possibly should be systematically assessed. 

The obvious way out to is to ensure that the definition of the hardness scale or 
transverse momentum in the fixed-order part of the simulation is identical to the 
evolution parameter in the parton shower. Alternatively, if this cannot be achieved, 
one could invoke truncated showering, already introduced in [782]. There, it was 
noticed that the parton shower implementation in HERWIG uses angles instead of 
transverse momenta as the evolution parameter. This leads to a situation where the 
first emissions in the parton shower typically are large-angle emissions of rather soft 
partons which will often result in a relatively small transverse momentum. At the same 
time, the more collinear splittings into partons with larger energies quite often appear 
towards the end of the shower evolution — implying that the hardest splitting, i.e. 
the one with the largest transverse momentum, can essentially happen at all stages 
of the evolution. This has to be accounted for in the matching, in order not to upset 
the resummation of logarithms in the parton shower. The only way to achieve this 
is to allow the parton shower to emit partons at larger angles, but lower transverse 
momenta relative to the hardest emission fixed by the POWHEG formalism, which in 
turn must be inserted at some point into the parton shower evolution. Such a strategy 
in principle could be employed whenever there is a mismatch of evolution and hardness 
scales in the parton shower and fixed-order parts of the simulation. 


5.4.4 Next-to-leading order matching — the MCQNLO method 
5.4.4.1 Underlying idea 


Historically, the first solution of how to match NLO matrix elements with the par- 
ton shower has been provided by the MC@NLO method, pioneered in [546]. While, 
somewhat loosely speaking, the POWHEG method is nothing but a matrix-element cor- 
rection method supplemented with local K-factors, the MCQ@NLO method is closer in 
spirit to analytic resummation. Similar to the way the calculation is organized there, 
the real emission correction is decomposed into a part driven by Sudakov form factors 
and realized by the parton shower, and a hard remainder. And, while the former will 
experience higher-order corrections, like in Qr resummation at NLO+NLL accuracy, 
the latter will not. In particular, the decomposition in MCQ@NLO is given by 

Ry (Gr) = RẸ (r) + RW (Sr) = Sv(Sp® G1) +H (Gp). (5.334) 
The catch in MC@NLO is to identify the subtraction terms with the shower kernels 
such that, symbolically, 


Sy (Sp ® 81) = X` By (Gp) @ Kij:¢(O1) = By(®p) 9 K(P1). (5.335) 
ijk 


The MC@NLO version for the differential rate up to the first emission thus is given by 
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+ dR Hn (®p) ; 
(5.336) 
where this time the NLO-modified Born term is 
By(®g) = By(®z) + Vy(z). (5.337) 


The virtual part of the NLO correction is applied only to those emissions that follow 
the kinematics given by the parton shower. In turn the phase space of such emissions is 
guaranteed to be in line with its leading logarithmic pattern. Since, by construction, the 
hard remainder Hy does not contribute any terms at the same logarithmic accuracy as 
the parton shower, the logarithmic accuracy of the latter is automatically maintained. 
Similar to its counterpart in Qr resummation, the hard emission term in the second 
line of Eq. (5.336) therefore has a dual role. It firstly corrects the hardest emissions from 
the parton shower such that they follow at fixed order the exact matrix element. At the 
same time it fills emissions in those regions which are inaccessible by the parton shower 
with the exact leading-order pattern. Therefore, also the MC@NLO method fulfills the 
fixed-order requirements for a successful NLO matching of matrix elements and the 
parton shower. It can trivially be seen that it also maintains the logarithmic accuracy 
of the parton shower, and potential problems with the arguments in the resummation 
related to the choice of parton shower starting scale are avoided by construction: the 
parton shower starts exactly at the same scale as it would without the matching: at all 
orders and with help of the all-emissions operator EM from Eq. (5.310) the expression 
in Eq. (5.336) can be rewritten as 


do = dbp Bn (28) EN” (ud, te) + dër H(Pr) EN (Hi, te), (5.338) 


where py is the starting scale related to the hard remainder. 


5.4.4.2 Treatment of colour 


A potential pitfall in the MC@NLO method is related to the fact that the parton 
shower is a leading colour approximation. This translates into the fact that for general 
processes it is impossible to subtract all soft divergences, since sub-leading colour con- 
figuration usually do exhibit soft singularities at next-to-leading order (but of course 
no collinear ones). 

This is merely a technical problem — the NLO accuracy remains unharmed by such 
potentially uncancelled singularities at sub-leading colour. This is because in princi- 
ple they could be cancelled after averaging over the various directions in the eikonals 
and identifying this suitably in the splitting kernels of the parton shower. This is how 
this problem indeed was cured in the original MC@NLO algorithm, when applied to 
tt-production at hadron colliders in [544, 546]. In practical terms, a dampening func- 
tion was introduced there, which essentially redistributes the soft sub-leading colour 
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divergences by modifying the leading colour subtractions. However, at fixed order, the 
hard remainder term Hy will modify this averaging and it will thereby guarantee that 
the correct resolvable radiation pattern at NLO accuracy is always recovered. 

Alternatively, it is also possible to modify the emission kernels for the first emission 
in such a way that the full colour structure is directly recovered [630]. To do this, the 
summing over colour structures inherent in the construction of the splitting kernels 
usually employed in parton showers has to be undone allowing to recover all colour 
terms. In the framework of a parton shower based on Catani—Seymour splitting ker- 
nels [466, 778, 839], this translates into replacing the colour-averaged kernels discussed 
in Section 5.3.2 with a sum over all dipole terms Dijk- Phrased in other words, the 
sum must go over all splitters ij and all spectators k irrespective of whether they are 
colour-connected or not. Thereby structures like the colour-insertion operators T;j: Tk 
will appear, made explicit for instance in Eq. (3.167) for the case of two final-state 
particles. 

Upon evaluation in the explicit case, these operators however may become nega- 
tive, leading to a negative weight for a splitting in this particular configuration. By 
construction, the sum over all T; will lead to Ti; due to colour conservation in any 
physical parton ensemble, and thus the splitting weights will be positive overall. How- 
ever, in certain directions given by certain spectators k, the admixture of positive and 
negative contributions will result in a negative contribution. At fixed order, such terms 
do not pose a problem. But in addition to the correct fixed-order result, in the imple- 
mentation in [630], these sub-leading colour structures are also part of the Sudakov 
form factor, thereby accounting to some degree also for effects beyond fixed order. The 
technical problem there is related to the fact that negative splitting weights lead to 
negative arguments in the Sudakov form factors. This apparent violation of a prob- 
abilistic interpretation manifests itself naturally in Sudakov form factors larger than 
unity. Such “anti-probabilistic” features necessitate a modification of the showering 
algorithm, cf. [630, 809] for technical details. 

While in most cases the effects of including such sub-leading terms are small, there 
are some observables for which they are surprisingly large. An example is provided by 
the case of the forward—backward asymmetry App in tt production at the TEVATRON, 
where such effects have been studied for instance in [627]. 


5.4.4.3 The interface to the parton shower 


Similar to what has been discussed already in the previous section, the K factor — 
essentially the correction related to the virtual part V — is only applied to those 
radiation configurations that are produced by the parton shower, while the other 
configurations, produced by the hard correction term, essentially come with their tree- 
level weight. This leads to an NLO modification of the radiation pattern below the 
parton shower starting scale, and an LO distribution above. In fact, this is exactly the 
behaviour one would expect, and it is more or less a matter of taste whether a smooth 
transition like in the POWHEG case or a steep one like in the MCOQNLO case takes this 
into account in a cleaner, better, or more transparent way. 

However, this behaviour leads to yet another subtlety related to the practical 
implementations of this method. As already seen in Section 5.3.2, there are parton 
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shower algorithms that may not fill the full emission phase space, like, for instance, the 
angular-ordered parton showers implemented in HERWIG [415] and HERWIG ++ [188]. 
Typically, these “holes” in the emission phase space are filled through the hard matrix 
element corrections, which, by a combination of clever scale choices in a, analogous 
to the parton shower and, eventually, some Sudakov weights will smoothly connect to 
the parton shower description in the transition between the two regimes. 

In MC@NLO, life may not be quite as simple. As the cross-section in the regime 
of soft emissions, treated by the parton shower is modified by the local K factor — 
essentially by the term V — this smooth transition may be lost and a mismatch 
in the radiation pattern may emerge. The emergence of a similar pattern can also be 
observed in the MCQNLO method as implemented in SHERPA [630] by constraining the 
real emission phase space by measures that are incompatible with the parton shower 
evolution. 


5.4.4.4 Comparison with analytic resummation methods 


At this point it is worth comparing the MCQNLO method with analytic resummation 
methods, and in particular Qr resummation. 

In the master formula for Qr resummation of heavy singlets in hadron—hadron 
collisions, Eq. (5.59), the hard remainder parts Y;;,x are not supplemented with 
any further emissions encoded in Sudakov form factors. However, the construction of 
the R-terms, exemplified for W production in Eq. (5.78), follows the same principle 
as the construction of Hy here: It results from the fixed-order real-emission cross- 
section based on the matrix element Ry through subtraction of the O (as) terms in 
the resummed part. 

In a similar way, analysing the structure of the resummed part in the MC@NLO 
equation above, it is fairly straightforward to realize that the subtracted virtual term, 
Vy includes not only the hard loop correction Ha» of Eq. (5.60) to first order in as, 
but also the two collinear terms Cia to the same order; in particular terms such as 
P: (z) are part of the underlying standard NLO subtraction procedure. This leaves the 
Sudakov form factors. In standard Qr resummation overall momentum conservation 
is guaranteed through the Fourier transform to impact-parameter space (b,-space). 
This of course is not necessary in parton showers, which have momentum conservation 
included for each individual emission. The only subtlety here is to realize that along 
the initial-state parton shower, all emissions must contribute to the overall Q of the 
singlet system. In the original proposal for the construction of a parton shower based 
on the Catani-Seymour splitting kernels and kinematics [778] this is not quite the case: 
there the recoil, i.e. the transverse momentum “kick”, of a splitting is only absorbed 
by the colour partner of the system and, in particular in the case of initial-state 
splittings, not by the full final state. This is the reason why this proposal has later 
been augmented with kinematical mappings where initial-state legs never decouple 
from the rest of the system during parton showering [626, 780, 807]. However, when 
comparing the terms in the Sudakov form factor of analytic resummation, Eq. (5.60) 
and Eqs. (5.65) — (5.67), it becomes apparent that the same terms A(?) and BO) 
— possibly up to power corrections of the type Q? /Q? involving no logarithms — 
are also present in the parton shower Sudakov form factors. This suggest that parton 
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showers that are ordered in k, or opening angles correctly resum those next-to-leading 
logarithms log(Q* /Q?) in singlet production, which are also present in the Sudakov 
form factors found in analytic resummations of the same quantity. 

In addition, a successful matching with fixed-order calculations at next-to-leading 
order also includes all terms appearing in the collinear and hard contributions — 
the formal accuracy of such samples therefore is NLO+NLL in the language of Qr 
resummation based on the Collins-Soper—Sterman approach. 


5.5 Multijet merging of parton showers and matrix elements 
5.5.1 Multijet merging at leading order 
5.5.1.1 Underlying idea 


The merging of multijet matrix elements at leading order recognizes the fact that 
for all emissions leading to a sufficiently hard new jet matrix elements provide the best 
description, while the strength of the parton shower lies in accounting for the softer or 
more collinear radiation inside such jets. The idea therefore is to decompose the emis- 
sion phase space into two complementary regimes: one hard regime of jet production, 
and a soft regime of jet evolution. In general such a division can be achieved by intro- 
ducing a jet resolution criterion Qeut, typically related to the transverse momentum of 
emissions. In the first multijet merging algorithm presented in [351], Qcus indeed was 
identified with a k,-jet measure from the Durham jet algorithm [345]. Consequently, 
emissions harder than Q.y are described by the appropriate matrix elements while 
softer emissions are left to the parton shower. 

Naively, thus, it would be preferable to push Qeut to the lowest possible values, of 
the order of the parton shower cut-off. In practice, however, this turns out to be poten- 
tially tricky. First of all, there are technical problems related to the increasingly poorer 
convergence of the multijet matrix elements with increasingly softer cuts, rendering 
event generation, especially of unweighted events, increasingly or even prohibitively 
CPU consuming. In addition, and as seen in previous chapters, the introduction of 
such a jet measure in fixed-order calculations for processes characterized by a typical 
hard scale Qnara introduces logarithms (or their squares) of the form log(Qnara/Qeut)- 
These logarithms will become larger for increasingly disparate scales Qeut and Qhara, 
and therefore they may overcome the smallness of the perturbation parameter, typi- 
cally a,/7 or similar. This signals a collapse of the validity of fixed-order perturbation 
theory and the need for resummation. While the latter is partially achieved by the par- 
ton shower, at least at leading or next-to-leading logarithmic accuracy, the residual 
sub-leading logarithmic terms may still introduce unwanted and dangerous correc- 
tions that will not be accounted for. It is therefore important that the behaviour of 
the calculation for small Qeut be carefully monitored, to thereby protect the calcula- 
tion against such unavoidable sub-leading contributions becoming numerically large. 
In practice this can be achieved by varying Qcut and evaluating the effect on critical 
distributions. 

However, directly including the parton shower Sudakov form factors, maybe in 
their analytic form, into the fixed-order calculation will lead to a stabilization of their 
behaviour and a vastly increased level of convergence for low emission scales, i.e. 


376 QCD to All Orders 


in the Sudakov region. This in fact has been worked out in more detail by the 
authors of the MINLO method in [609, 611], see Section 5.6.1, who combined the scale- 
setting prescription of multijet merging methods discussed below with a corresponding 
Sudakov reweighting, allowing them to push Qeut to values around or below O (1 GeV), 
the infrared cut-off of the parton shower. 

Going back to Fig. 5.16, the picture there as applied to multijet merging at leading 
order translates into using the vertical lines, corresponding to fixed-order expressions 
for jet production, with the number of jets increasing with the order of as and to 
combine them with the terms populating the diagonals. Immediately, the problem 
of double-counting of some terms becomes apparent; it should be stressed that this 
double-counting in general can be constructive, i.e. the same terms are attributed 
twice, or it can be destructive, by not covering them at all. In order to remedy this 
problem, a dual strategy comes into play. First of all, the matrix-element expressions 
are evaluated with suitable scale choices for the strong coupling, in such a way that 
their counterparts in the parton shower are emulated. In addition, they are weighted 
with suitable Sudakov form factors. With the interpretation of the Sudakov form 
factors as no-emission probabilities in mind, this second step transforms the matrix 
elements, which describe the inclusive production of an N-jet system plus anything else 
into matrix elements that describe the exclusive production of an N-jet system only, 
with no further resolvable emission above Qecut. At the same time, the parton shower 
is modified such that any further jet emissions are vetoed. There are various ways of 
achieving this, which impact differently on the parametric accuracy provided by the 
matrix elements and, in particular, by the parton shower. While maintaining the fixed 
order accuracy of the former is fairly straightforward, the logarithmic accuracy of the 
parton shower is harder to conserve. 


5.5.1.2 Reweighting matrix elements 


The idea underlying actual algorithms can easily be understood going back to Sec- 
tion 2.3.3, where k,-jet rates in electron—positron annihilations to hadrons were ana- 
lytically resummed, cf. Eqs. (2.184) and (2.186). With the interpretation of Sudakov 
form factors as no-emission probabilities, the two- and three-jet rates, i.e. the proba- 
bility for emitting no or only one jet, can be approximated, at logarithmic accuracy, 
as products of such no-emission probabilities and the terms relevant for the emission 
of a single parton, 


Ro (Qeut) = [Aq(ua, 2o 
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The integrated splitting kernels T4, have already been introduced in Eq. (2.181), 
leading to Sudakov form factors Aq,g, which, depending on whether a, is taken as 
fixed or running assume the form of Eq. (2.182) or Eq. (2.183). The upper limit ug of 
the Sudakov form factors, and correspondingly of the integration over the transverse 
momentum of the emitted gluon, is usually identified with the only hard scale of the 
process, the centre-of-mass energy Eems of the e~e* pair. Since Ro and Rz are jet 
rates, they are normalized to the total hadron production cross-section. Therefore the 
production of the original quark—anti-quark pair in the electron—positron annihilation, 
proceeding through electroweak interactions, has been factored out. The relationship 
of the corresponding approximate cross-sections at O (as) is given by 


ro 2 2 
Ps (Qeut) = 1-He(Qeut) = / dgi PES rub, @)| + O(a?) . (5.340) 


Remembering that I’, is nothing but the integrated splitting kernel it becomes quickly 
apparent that this expression indeed is the O(a,)—approximation of a k; -ordered par- 
ton shower to the respective cross-section. 1? 

In the parton shower language, Eq. (5.340) can be cast into 
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In order to improve this description of the three-jet rate as provided by the par- 
ton shower with exact tree-level matrix elements, the expressions in the integral of 
Eqs. (5.340) and (5.341) merely have to be replaced with the exact matrix elements. 
Choosing the scale of ag in the matrix element as in the parton shower also includes 
the corresponding logarithmic terms. This idea has already been encountered in Sec- 
tion 5.4.2, where matrix-element corrections to the hardest emission were discussed. 
Using them, the exact radiation pattern up to O(a,) was recovered by reweighting the 
parton shower with the fixed-order matrix element at the same perturbative order. 

In contrast, multijet merging algorithms start from a matrix element, with an 
inverted logic: instead of the parton shower being reweighted with the matrix elements, 
the matrix elements are modified by terms stemming from the resummed expression. 
This is achieved through a combination of adjusted scales of a, and reweighting with 
the Sudakov suppression factors which have been omitted in the fixed-order expressions 
in Eqs. (5.340) and (5.341). As a result, the procedure resums the same logarithms 
as the parton shower does, but, in addition, reproduces the exact fixed-order result 
provided by the tree-level matrix elements. 

The remaining problem is to determine the adjusted scales for both the a, and the 


10The Sudakov suppression factors, being exponentials, could in principle be Taylor-expanded to 
approximate higher order terms. While this is not relevant for merging technology at leading order, 
it becomes relevant for fixed order matrix elements beyond tree-level, see Sections 5.5.3 and 5.6.1. 
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Sudakov reweighting. To this end, a parton-shower emission history is constructed 
from the matrix-element configuration by a recursive algorithm. In each recursion step 
r, all allowed parton pairs i and j, which may be combined into a joint emitter ij, 
together with all possible spectators k are considered. The respective flavours and 
momenta define a splitting kernel Kj; ;, and the corresponding kinematics. Following 
the logic of parton showers, the latter is characterized through the ordering parameter 
t), the splitting variable z”) and the azimuth angle °). From them, the nodal scale 
value po entering the strong coupling is readily read off. The momenta after undoing 
the splitting ij — i + j are given by the inverse of the parton shower phase-space 
map. Repeating this procedure until a core process — typically a 2 > 2 scattering 
process — is reached, thus yields a sequence of splitting kernels and nodal values for 
the coupling constants. 

There are two ways of selecting the actual parton shower history from such a 
procedure: Either in a “winner-takes-all” strategy, where step-by-step only the most 
probable clustering, typically the one with the smallest transverse momentum, is re- 
tained, or in a probabilistic strategy where the “winning” parton history is selected 
according to overall weights given by the product of the respective splitting kernels. 

In general, irrespective of such details of the final selection strategy, the construc- 
tion of the parton shower emission history leads to a sequence of m nodal values of 
hardness scales — typically transverse momenta — for the backwards clustering, 


Q™ < QD <... < QO < OM <Q = Q? (5.342) 


the respective values of the parton shower evolution parameter 


pm aD a a L a AD ONS) eo (5.343) 


and, correspondingly, for the scales entering the strong couplings 


ue < < p» TLE uP < uP < ion i (5.344) 


In the discussion up to now, the values of t have been identified with the corresponding 
Q?, which would usually be the case; it is worthwhile to mention, however, that for 
instance for angular ordered showers this is not quite the case. There an ordering in 
hardness/transverse momentum Q does not usually also manifest itself in an ordering 
in the emission angles. As in the case of next-to-leading order matching, truncated 
showering as defined in [782], must be applied in such circumstances, see below for 
a more detailed discussion of its effect. But, indeed, there are some further subtleties, 
which, as already hinted at, most notably concern strategies for dealing with unordered 
emissions. Such rather pathological cases will not be discussed here. 

Nevertheless, with the information encoded in the parton history at hand, the 
overall scale ur of a matrix-element configuration is determined as 


ZII (ub) = aM (uh ore) LI osha (5.345) 


rem 
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where M is the power of ag in the hard core process, ylor*) is the scale choice associated 


to this process, and where the pO are the scales of the m QCD splitting nodes. In 
a similar way, the evolution scales t@ and the core scale (°°°) define the Sudakov 


suppression factors. 


5.5.1.3 Interlude: A scale-setting prescription for LO and NLO 
calculations 


One of the reasons for the phenomenological success of multijet merging can be traced 
back to the fact that multi-parton cross-sections that have genuinely been calculated at 
leading order look surprisingly similar in shape to the corresponding next-to-leading 
order ones, if the former have been subjected to the scale-setting prescriptions de- 
scribed previously. Of course, this prescription simply generalizes to the case of NLO 
calculations, where, after ignoring the softest parton in the real emission contribution 
the overall scale can be deduced from Eq. (5.345) as well. These ideas have also been 
adopted in the MINLO framework [611], cf. Section 5.6.1. 


5.5.1.4 Vetoed parton showers 


Moving to the jet evolution part of the simulation, the interplay of the two expressions 
for the two- and three-jet rates, Eq. (5.339), will also fix the way the parton shower 
must supplement the radiation pattern below Qeut in such merged simulations. Simply 
put, the correct jet rates must be recovered at the same logarithmic accuracy for all 
values of a jet resolution QJ, above and below Qeut. 

The cross-sections for the various jet multiplicities have been made exclusive with 
respect to the emission of additional jets, by multiplying them with the Sudakov 
form factors. This is, yet again, a manifestation of the interpretation of the Sudakov 
form factors as encoding no-emission probabilities at their logarithmic accuracy. The 
modified cross-section expressions can thus be combined into an inclusive sample, 
where each jet multiplicity covered by the fixed-order matrix elements is described 
with differential and total cross-sections at tree-level accuracy with an improved scale 
choice. From the matrix element point of view, there is no double-counting present 
in such a sample.!! However, up to now each of the jets in this sample consists of 
exactly one parton only, and therefore must be further populated by invoking the 
parton shower. 

There are two caveats related to naively applying the parton shower. First, the 
parton shower is not allowed to produce any unwanted jets, in order not to spoil the 
description of jet rates provided by the reweighted matrix elements, which is accurate 
at fixed order (supplemented with a resummation of leading and, eventually, sub- 
leading logarithms). Naively, this could be achieved by choosing teut as the starting 
scale of the parton shower evolution of all external partons, thus explicitly disallowing 
all harder emissions, which typically would become additional jets. As will be seen, 


11A fixed-order version of this idea has been provided in the BLACKHAT +SHERPA framework, 
dubbed “exclusive sums”. There, NLO matrix elements for V+ jets production with increasing jet 
multiplicity are added, with the phase space for the real emission correction constrained to inner-jet 
radiation only, i.e. such that it does not produce additional jets; see also a short description in [134]. 
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this conflicts with the second potential problem of evolving the jets through the parton 
shower, namely the condition to maintain the intrinsic accuracy of the latter. 

To see how this works out, consider, as an example the emission of a parton off 
a two-parton configuration in e~et — jets. Such a configuration has been identified 
with a two-jet event, and thus contributes with a rate given by Re(Qeut) as given in 
Eq. (5.339). The event therefore already has been weighted with a Sudakov suppression 
factor 


HQ 
d sC 3 
RaQ) = [Aal Qu]? = exp -2 | Se SEE (toe re =) 


T 
gus = (5.346) 
&œs—> const. Qs CF 2 HQ 3 HQ Ji 
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x | T ( S Qeut 2 ° Qeut 


where, for simplicity, effects of the running of the strong coupling are ignored. What 
would the two-jet rate be for values QJ < Qeut? Applying the same logic, the result 
should be given by R2(Q_7). However, if the suppression factor above was combined 
with the corresponding Sudakov suppression in a parton shower starting at Qeut 


Ro (Qs) > [Aq(ua, aut) ` Aq( ve Q2)” 
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(5.347) 


results. This simple consideration shows that a naive treatment leads to an unwanted 
and unphysical dependence on Qeut in the two-jet rate at QJ. In addition, it is apparent 
that a parton shower starting at Qeut only yields very limited radiation just below this 
scale — in fact, for a scale q > Qeut the radiation vanishes completely. This of course 
would yield a completely unphysical radiation dip just below Qecut. Therefore, simply 
starting the parton shower at Qeut is not an option. 

The solution to this problem presents itself, when analysing the structure of emis- 
sions mediated by the parton shower. Starting the parton shower at uo and vetoing 
every emission above Qeut yields an expression that reads, for a single quark leg, 


HQ 


dq. as Cr 
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S TE rue a) 
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HQ HQ 


dq. as Cr 1 dq. as CF 
= 14 r H r fod 
fp Ennu] f BEF rune. a) + 
Qeut cut 
dq. Os Cr S 
= exp / a a Talno, a1)} = A7 ug, Qeut) » (5.348) 


thus compensating the Sudakov suppression on the matrix element and eliminating 
the dependence on Qeut for the two-jet rate at scales QJ < Qeut- This interplay of 
Sudakov rejection and vetoed parton showering was at the core of the proof of the 
logarithmic accuracy of multi-jet merging in [351]. The original proof has later been 
extended to also include initial state radiation, showing that a merging prescription 
can be formulated which exactly maintains the logarithmic accuracy of the parton 
shower, irrespective of implementation details [632]. 

In general this reasoning translates into an algorithm, where the parton shower 
evolution of each parton starts at the scale where it was first produced, as constructed 
from the parton history. This also gives rise to the Sudakov suppression weight on the 
matrix element. When running the parton shower, however, all emissions that would 
lead to the production of a jet above Qecut are vetoed. The corresponding algorithm 
is also known as vetoed parton shower. This compensates, by construction, the 
Qeut-dependence in the event inherited from the suppression applied on the matrix 
element. 

Two comments are in order here. First of all, while this compensation is, in prin- 
ciple, accurate at the logarithmic accuracy of the Sudakov form factors employed, 
there may be mismatches. This occurs if the analytic Sudakov form factors are not 
exactly reflected in the parton shower, which typically is the case. Reasons for this, 
of course, range from an ordering in an evolution parameter different from the trans- 
verse momentum used in the kų algorithm employed to construct the resummed jet 
rates above, over non-logarithmic contributions emerging from finite terms in the z— 
integral of the splitting functions, to the exact inclusion of recoil effects inside the 
parton shower. These mismatches, despite typically being of sub-leading logarithmic 
accuracy, may become numerically important and would then manifest themselves in 
observables such as differential jet rates or similar. 

In addition, when using parton showers with a sufficiently different ordering such 
as, for instance, angular ordering, it quite often happens that the first emissions are 
not the hardest ones according to the jet criterion. In such a case, vetoed showering 
alone will not be sufficient, and measures must be taken not to upset the logarithmic 
and colour structure produced by the parton shower. The solution to this problem 
of a mismatch of the parton shower evolution parameter and the hardness ordering 
consists of employing what is known as truncated showering [782]. In this formalism, 
the parton shower is allowed to emit partons at a transverse momentum below the 
relevant cut in hardness but with an evolution parameter above the one related to the 
hard emission providing this cut. In this way a radiation pattern is generated that is 
ordered in the evolution parameter of the parton shower but unordered in the hardness 
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Qo > Qeut QD < Qeut QHD a Qeut 
xO) = +1) > +2) 


Fig. 5.20 Sketch to illustrate the idea underlying truncated showering. 
While the hardness of an emission according to the jet definition is given 
by Q, the parton shower evolution parameter squared is denoted by t. 


parameter of the jet criterion. This is illustrated in Fig. 5.20. 


5.5.1.5 Compact description 


There remains a potential problem, namely a mismatch of analytic expressions em- 
ployed for the Sudakov suppression weight on the matrix elements and the Sudakov 
form factors employed in the parton shower. The former encompass logarithmically 
enhanced terms only, which emerge when integrating over the energy splitting param- 
eter z in its limits from 0 to 1. The latter, on the other hand also have finite terms of 
the type q?/Q? related to the limits in z, essentially structures that look like power 
corrections in the transverse momentum of the emission. 

This slight mismatch, however, can easily be overcome by directly using the parton 
shower to generate the Sudakov suppression terms of the matrix element. This was first 
noted [730] in the framework of multijet merging with the dipole shower implemented 
in ARIADNE [729]. The logic behind this is very simple: using that the Sudakov form 
factor in any case represents a no-emission probability, vetoing every event where the 
parton shower produces an unwanted emission with a transverse momentum k, > 
Qeut automatically generates the correct Sudakov suppression, thus playing the same 
role as the analytic weights. Of course, the initial parton configurations produced 
by the matrix elements still need to be clustered back to a core process in order to 
construct a parton shower history, providing the nodal scale values entering a; and 
yielding the starting conditions of the parton shower. 

This also enables a very compact way of analysing the structure of emissions in 
multijet merging. To see how this works in principle, consider first the case where 
matrix elements corresponding to the production of N and (N + 1) jets are merged 
into one inclusive sample. Up to the first emission of the N-parton configuration, the 
differential cross-section for the production of (N + 1) particles reads 


LN 
AW (u, te) + I de, Key AN? (u2, tn+41)O(Qeut — Qn 41) 
te 


+ d@n41 By 41 A Pic, tw+1)O(Qn+1 — Qeut) 


do = d®y By 


(5.349) 
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Here, Qn 41 is the hardness scale related to the emission of the (N + 1)th particle. As 
advertised before, this scale defines two different regimes, namely the parton shower 
region Qv+41 < Qeut and the matrix-element region Qv+1 > Qeut. While in the 
parton shower only expression, Eq. (5.304), such a differentiation is not being made, 
in contrast here the description of the emission of the (N + 1)th particle is susceptible 
to this difference. 

This of course has some consequences. Taking a closer look at the square bracket 
in the first line it becomes apparent that it does not integrate to unity any more, due 
to the phase-space constraint — encoded through O(Qeut — Qn+1) — present in the 
second term, the emission term. The missing hard emissions are of course supplied 
through the second line, explicit through the complementary constraint O(Qn+41 — 
Qeut). There are two slight mismatches which prevent this term to also account for 
the pieces lacking in the first line to integrate to unity. First of all, there is a mismatch 
in the form of the emission term: the exact matrix element for the (N + 1)-particle 
state is different from the exact matrix element for the N-particle state convoluted 
with the parton shower kernel. This does not come as a surprise, as this was the very 
reason multijet merging has been introduced in the first place. 

On top of this, the phase space available for this additional emission, the (N +1)th 
particle, differs in both lines. In the first line the upper limit on the available phase 
space is given by the relevant parton shower starting or resummation scale defined 
through the N particle kinematics, wy», while in the second line it is the potentially 
different (N + 1)-particle kinematics, which defines the corresponding scale pin +1. 
While in processes with a fixed hardest scale, such as electron—positron annihilations 
to jets it is safe to assume that typically uy and py +1 are very similar or even identical, 
this is not true for processes at hadron colliders such as the production of lepton pairs 
(the Drell-Yan process) in association with jets. 

There, the emission of additional jets typically may offer phase-space regions associ- 
ated with larger scales than processes without these additional jets — in the example 
of Drell-Yan-type processes this would correspond to jet emissions taking place at 
transverse momentum scales above the invariant mass of the lepton pair. Taken to- 
gether, this leads to the fact that the combined contributions from the hard and soft 
first emissions will not exactly combine with the no-emission term to yield unity. This 
has been, in a somewhat sloppy use of language been coined “unitarity violation”. 
It consequently leads to a variation of the total cross-section related to the inclusive 
sample produced with respect to the Born result. Postponing details to later parts 
of this section, it should be noted that this effect, while present in some of the more 
widely used multijet merging implementations, has been taken care of by the UMEPS 
algorithm [733, 806], which conserves inclusive cross-sections by suitably reshuffling 
different contributions. 

Taking into account first emissions only, but combining many Born matrix elements 
up to a maximal number Nmax of external legs promotes the simple expression of 
Eq. (5.349) to 
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Note that in the contribution from the Nmax configuration, the phase space filled by 
the parton shower is not constrained by the jet cut Qecut but by the jet measure of the 
last emission filled by the matrix element. This allows the parton shower to account 
— within its limitations — for even higher jet multiplicities. 

At this point it should of course be stressed that further emissions are based on 
splitting kernels supplemented with the phase-space veto: 


Ky (®1) ES Ke (®1) = Kw(®1)OQ - Qn). (5.351) 


Taking this into account and applying it to the subsequent emissions encoded in the 
parton shower evolution operator, cf. Eq. (5.310), Ew) becomes 


ENSO (He, te) 
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(5.352) 


Inserting this into the merging equation Eq. (5.350) above results in 
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This improved description has been implemented for a variety of parton showers, 
most of which use an ordering parameter related to the transverse momentum of 
the emissions and thus do not exhibit any real mismatch of evolution and hardness 
parameters. For a more in-depth discussion of the various realizations the reader is 
referred to the literature (632, 732]. Multijet merging with an angular-ordered parton 
shower in the framework of HERWIG++ has been explored in [612], where the impact 
of not employing truncated showering in the merging has been analysed, too. 


5.5.1.6 Alternative: The MLM method 


The MLM prescription for multijet merging [147, 744] was introduced and further 
refined in parallel to the method discussed so far. It is based on slightly different 
ideas: most notably it aims at using traditional parton shower routines like the ones 
implemented in PYTHIA [853] (virtuality or p,-ordered shower) or in HERWIG [414] 
(angular-ordered shower) which are accessed through a standardized interface between 
general matrix elements and parton showers [145] without any direct alteration in 
the code. The structure of the resulting algorithm implies that vetoes on unwanted 
emissions can only be applied a posteriori, i.e. after the parton shower evolution has 
finished. In practice this is realized by reclustering the partons after the parton shower, 
i.e. at the hadronization scale te, into jets and by comparing these jets with the original 
ones, from the parton level at scale Qeut. If an additional jet with respect to the original 
ones has been produced, or, conversely, if a jet has been lost, the event then is rejected. 
This algorithm of course incorporates a slightly different way of treating emissions off 
intermediate legs, which here are re-interpreted as radiation off external legs, by a 
suitable definition of parton shower starting conditions.'” 

In its original version, the MLM prescription uses a simple cone definition as the jet 
criterion to generate the parton configurations at the matrix-element level. The scales 
of strong couplings are reconstructed by using a backward clustering with a kų mea- 
sure. Then the accepted configurations are passed on to the parton-shower routines. 
They in turn typically reconstruct the parton-shower starting scales of a multi-parton 
configuration by directly inspecting colour connections, without any backward cluster- 
ing and therefore partially neglecting intermediate legs and the radiation originating 
from them. Having the starting conditions at hand, the parton shower is invoked with- 
out any constraint. After it has terminated at the hadronization scale te, the original 
partons stemming from the matrix element are exactly matched to the jets present at 
parton-shower level, again defined by the cone algorithm with parameters, which in 
principle may slightly differ from the ones applied at matrix-element level. If such a 


12In such a treatment, the full capture of truncated showering effects is not guaranteed, and as 
a consequence some residual sub-leading logarithmic terms may be left out [632], which would be 
present in the original parton shower algorithm. 


386 QCD to All Orders 


one-to-one match is not possible, either due to extra, unwanted jets being produced in 
the parton shower or due to “losing” jets in the parton shower, the event is rejected. 
This means that the Sudakov rejection factors that are applied locally, either through 
analytic Sudakov form factors or by using the shower, as explained above, are applied 
inclusively over the full parton configuration. As a by-product of this treatment, the 
MLM approach penalizes “losing” jets which is not the case in the original merging 
prescriptions. This algorithm has originally been implemented in ALPGEN [743, 744]; a 
variant of it using the Durham k_ -algorithm has later been provided in the MADGRAPH 
framework [146]. 

Despite the subtle differences in the different approaches, by far and large a good 
agreement of predictions obtained between both methods can be observed. Respective 
results have been reported for the case of W+jets production at the TEVATRON and 
the LHC for example in [147]. 


5.5.1.7 Extensions: Dealing with photons 


The extension of the merging algorithm to also include photons is fairly straightfor- 
ward: treating the emission of photons “democratically”, i.e. on the same footing as 
QCD emissions allows to embed them into a multijet merging such that both the 
number of QCD particles and photons will vary. Evaluating matrix elements at the 
tree-level with Nacp QCD particles and N, photons is not a problem and can be 
dealt with standard technology. There is also no difficulty in supplementing the jet 
definition encoded in QJ; and a corresponding cut Qcut with an isolation criterion Q} 
and a corresponding cut Qiso for the emission of photons. From the parton shower side, 
things are similarly trivial and basically amount to supplementing q —> qy splitting 
kernels, which typically can be obtained from the q —> qg ones by suitably replacing 
coupling factors. In principle it is also possible — and probably even desirable — to 
also include y > ff splittings and to have different infrared cut-offs for the QCD and 
the QED part of the parton shower, in other words to supplement t. with a te, (QED). 
The latter typically would be chosen on scales of the order of the 7° mass or similar, 
which is of course possible due to the absence of a Landau pole in QED in the soft 
regime. 

Of course, the same logic could also be applied to extend this treatment to, e.g., 
the emission of the weak vector bosons, the W= and Z bosons. There is one caveat, 
though, related to the fact that the coupling of, say, W bosons to fermions is chiral 
and therefore highly sensitive to the spin of the fermions and, consequently, introduces 
non-trivial spin correlations in the parton shower. Including them in a systematic 
way would imply that the easily implemented structure of more or less independent 
emissions in a simple probabilistic fashion would have to be augmented with spin- 
correlation matrices of the kind discussed in [825] or a treatment similar to the one 
discussed in [779]. However, for the emission of one boson only some matrix-element 
corrections could be applied which would by far and large capture such effects. This 
has been studied in more detail in the framework of the PYTHIA event generator [396]. 
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5.5.1.8 Aside: unitarization of multijet merging: UMEPS 


The idea underlying the “unitarization” of multijet merging is to conserve the fixed- 
order cross-section of the inclusive process with lowest multiplicity — in the MEPS 
master equation, Eq. (5.353), this would be the cross-section of the N-particle process, 


J do0) = J dönBy = i da (UMEPS) | (5.354) 


This is in principle achieved by shuffling contributions from higher multiplicities to 
lower ones, thus compensating for the mismatch of fixed-order real emission cross- 
sections and their parton-shower approximation above Qeut. Such a mismatch mani- 
fested itself in Eq. (5.349) as a difference between By+1 and By 8 Ky. 

To see in more detail how this works, consider first the expression for the Sudakov 
form factor, Eq. (5.251), which can be decomposed as 
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Using this in Eq. (5.349) allows to rewrite the first line as 
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Again, because of the identical kernels in the individual terms and the identical phase- 
space constraints, the square bracket integrates to unity. Using the probabilistic nature 
of the parton shower, the Sudakov form factor, being interpreted as the no-emission 
probability, can be recast as unity minus the emission probability. In other words, in 
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the integrated parton shower emission rate above Qeut is given by the second term. 
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The unitarization of the cross-section is now achieved by suitably replacing the 
kernel in the integrand of Eq. (5.357), 
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Merging now the first emission through the corresponding Born matrix element yields 
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Similar replacements are applied in Eq. (5.350) to all brackets encoding the parton 
shower evolution, with the exception of the Nmax-term. Consequently, all N-parton 
inclusive cross-sections are governed by the input cross-sections associated with the 
By. Some comments are in order here. First of all, it is possible that a parton shower 
cannot produce all particles or phase-space configurations entering these cross-sections. 
In such a case the corresponding fixed-order cross-sections must be regarded as genuine 
corrections to the parton shower and therefore they will be just added. This is also 
true for the impact of the phase-space constraints given by the © functions. It is 
not always guaranteed that integrating over the one-particle phase space will yield a 
lower-multiplicity state that passes the Qcut criterion — sometimes one loses more 
than one jet. Handling such contributions is ambiguous, since they could be regarded 
as genuine fixed-order corrections as discussed in [806] or as contributions to even 
lower multiplicity states as in [733]. 


5.5.2 Hybrids of merging and matching: MENLOPS 


In a first step towards a full multijet merging on the basis of NLO multijet matrix 
elements, and with NLO matching and LO merging methods well established it is worth 
discussing a hybrid between these two methods. By now this has been established 
as the MENLOPS method, and it has been introduced and implemented for various 
processes in [608, 629]. It combines matching, either with the POWHEG or the MCQNLO 
method, of QCD NLO matrix elements for the lowest-multiplicity final states with an 
LO multijet merging of higher-multiplicity matrix elements. This implies that the 
NLO matching has to be performed in such a way that the real-emission part of 
the NLO correction does not produce any additional jets; fully in line with the LO 
merging paradigm therefore the emission phase space has to be constrained through a 
jet measure QJ and a corresponding cut Qeut. This requirement is realized by suitably 
multiplying with a Sudakov form factor. In addition, the no-jet emission constraint 
must act on the phase space available for both the real-emission correction encoded 
through an exact matrix element and on the parton shower parts. 
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While this is, by far and large, not too hard to implement, there is another some- 
what more nagging problem. The jet-exclusive cross-section for the lowest-multiplicity 
final state will be modified by the local K-factor discussed in Sections 5.4.3 and 5.4.4, 
while the higher multiplicities will not be corrected and be at Born level only. This 
can lead to discontinuities in the radiation pattern, in particular of the first, hardest 
emission, and especially in those cases, where the K factors are fairly different from 
unity, like, for instance, in the case of Higgs production through gluon fusion. In order 
to remedy this situation, the higher-multiplicity part could be multiplied by an inter- 
polating K-factor, ky, capturing the NLO correction to the lowest multiplicity N. For 
example, for the MENLOPS method, where the NLO matching part is realized through 
the MC@NLO algorithm, this interpolating K-factor may be given by something like 


kn(®n41) = 


By (1 HN ) HN ; aa for soft emissions (5.360) 


Bn BN+1 l BN+1 1 for hard emissions. 


Such an interpolation works fairly well, since in the soft limit the real correction term 
and its subtraction become very similar and therefore the hard remainder is given by 


soft 


Hn = Ry- Sy #8 0, (5.361) 


while in the hard limit the subtraction term is not very prominent and therefore 


Hn = Ru -Sn S Ry = Be: (5.362) 


Concentrating again on contributions up to the first emissions only, the differential 
cross-section for such a sample is given by 


do = d®y O(Qn — Qeu) By 


un 
x aw (ux, teut) + | de, Ky AW (u2, tn41)O(Qeut — Oven) 


tout 


+d6y+11 O(Qn — Qeut)O(Qeut — Quai) HWA? (u3:41,tw4s) 


+ d®y41 kw (nt) O(N — Qeut) By yA? (uN twa) 


tN+1 


AO niis teut) + / d®, Kn AO) (wa, tn+2)O(Qeut — z) 


tout 


+ d@ny +42 kw (®n 42) O(QNn +2 — Qeut) Brash Ging tw2)AW (tntz, twat) 


ms as teat aie ecg 


(5.363) 


390 QCD to All Orders 


Here, the first three lines correspond to an MC@NLO simulation, where the phase 
space for the (N + 1)th particle is constrained such that it does not yield another jet 
— this is what the O-functions O(Qeut — Qn +1) are there for. In order to make the jet 
counting more explicit, another O-function, 0(Q Nv — Qeur) has been added to highlight 
that all other QCD particles in the N-particle Born-level final state should be jets. 
Similarly, the fourth and fifth lines encapsulate the first additional tree-level matrix 
element, merged into the sample, and supplemented with the interpolating K factor 
ky. Again, no further jet emission is allowed. This could now continue like indicated by 
the sixth line, to include higher and higher jet multiplicities. In any case, any further 
emissions, as discussed already for the multijet merging at leading order, cannot result 
in additional unwanted jets. 


5.5.3 Multijet merging at next-to-leading order: MEPSQ@NLO 
5.5.3.1 Basic idea 


The idea underlying multijet merging with next-to-leading order matrix elements, 
MEPS@NLO, is identical to multijet merging at leading order: towers of matrix elements 
with increasing jet multiplicities are combined into one inclusive sample in such a way 
that no double-counting occurs. Of course, as before, it is important to maintain the 
accuracy of both the matrix elements, i.e. the cross-sections related to the processes 
as well as the fixed-order accuracy of the first emission, and the parton shower, i.e. the 
resummation of leading and the next-to-leading logarithms encoded in the showering. 

This has been first achieved in [558, 631], where the first missing terms have ex- 
plicitly been shown to be of order a2L°/N? — the colour-suppressed sub-leading 
logarithms beyond shower accuracy. Alternative methods and implementations have 
been presented in [535], and in [737, 806]. The latter two are closely related to one 
another, and both guarantee the proper “unitarization” of the emissions in line with 
the leading-order treatment by the same authors [733, 806]. 


5.5.3.2 First emission(s), once more 


Following the original presentation of a multijet-merging method of next-to leading- 
order matrix elements in [558, 631], this MEPS@NLO method can be understood as 
a merging of individual MCQ@NLO simulations. Some additional terms need to be in- 
cluded, though, to guarantee the fixed-order and logarithmic correctness. Concen- 
trating first on the merging of matrix elements for the production of an N and an 
(N + 1)-particle state only, and taking into account the first emissions only, a naive 
addition of two MC@NLO simulations for both would yield the (wrong) cross-section 
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doers) = döy O(Qn — Qeut) By 
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BN 
AY (ur, te) + I d®, KAM (u2, tw41)O(Qeut — Oven) 


te 


+ dOy41 O(Qn — Qeut)O(Qeut — Qu) Hwa (2, tw 4) 


+ d®y41 O(Qna1 — Qeut) By yt 


x 


tN+1 
AW (tnn te) + ri d®, Erpat} verteea) 
te 


+ d@y42 O(Qn41 — Qeut) Hn AP (ud, DAR (tnt, tn+2). 
(5.364) 


The first three lines would correspond to an MC@NLO simulation, where, in complete 
agreement with Eq. (5.363), the phase space for the (N + 1)th particle is constrained 
such that it does not yield another jet, realized by O(Qceut —Qn41) in the two emission 
terms, soft and hard. The next three lines would then stand for the next MC@NLO 
simulation, for a process with one more particle in the final state. 

Naively, it seems as if there is no double counting present here. However, this is not 
entirely true. To see this, consider the emission term in the first, soft part of the lowest 
order MC@NLO simulation, the term KvyAY (2, tn41) in the first square bracket 
and the corresponding hard emission part in the line below. Closer inspection reveals 
that at first order in a,, there is some unwanted contribution to the phase space of 
the next MC@NLO simulation. It stems from the expansion of the Sudakov form factor 
accounting for no emissions harder than ty41 in the combined soft and hard radiation 
pattern, ranging over the full emission phase space from ty +1 up to wy: 


2 


HN 
dy f dd, ByKy + dnp Hn | O(Qeut — Qnn) AY? (uh, tr41) 
te 


HN 
= aby f dd, (By OKu + Hw) + O(a2)! O(Qeut — Qna) AW (utn) 
te 


uN 
dO By OOne—Onaay |1 — i d®Ky + O (a2) |. (5.365) 


tN+1 


The second part of the bracket in the last line therefore interferes with the emissions 
of the MC@NLO simulation of the incremented multiplicity. This double-counting of 
emissions must obviously be avoided. There are various way of achieving this, the 
simplest one is by adding the second term in the square bracket above to the (N +1)- 
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MC@NLO simulation: 
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+ d®y+2 O(Qn41 — Qeut)O(Qeut — QN+2) HN+1 


x A (NETS twa AW, (tw41,tn+a) - 
(5.366) 


As in the MENLOPS method described in Section 5.5.2, the first three lines merely 
describe an MC@NLO simulation of the lowest multiplicity sample with an additional 
jet veto applied to all emissions. In a similar way, the next lines describe an MCQNLO 
simulation for the next higher multiplicity, modified by the term in round brackets in 
the fourth line compensating for a double-counting of terms stemming from the lowest 
multiplicity MCQ@NLO. 

As already worked out, this term compensates contributions stemming from the 
Sudakov form factor encoding the veto of hard emissions from the lower multiplicity, 
the two first lines. Such a compensation must be introduced into the formalism in 
order to maintain the NLO accuracy of the overall procedure. The fact that this 
double-counting of NLO terms is introduced by the parton shower (or analytic Sudakov 
form factors encoding the jet veto) renders these contributions potentially hard to 
understand at first. Realizing, however, that Sudakov form factors encode some of 
the NLO corrections in a leading-logarithmic approximation, this should not come as 
a surprise. In fact, the treatment here actually is fully analogous to the one of the 
Sudakov form factor in its interaction with higher-order matrix elements, presented 
later in Section 5.6.1, and made explicit in Eq. (5.372) there. 

Naively, such terms also seem to be hard to implement when generating the Su- 
dakov rejection directly from the parton shower rather than analytically. This turns 
out not to be entirely true. In close analogy to the discussion of vetoed emissions, cf. 
Eq. (5.348), it can be seen that these terms correspond to a vetoed emission off the 
N-particle state only. Instead of vetoing an event when an unwanted emission takes 
place, it is merely the hard emission itself that should be vetoed. 
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Of course, the very same logic could be iterated to include even higher multiplicities. 
In addition, further leading-order matrix elements could be added following the same 
reasoning with an interpolating K-factor as in the MENLOPS algorithm. 


5.5.3.3 Scale setting in MEPS@NLO 


One remaining problem is the definition of scales, and in particular the nodal values 
for the individual emissions and, correspondingly, the renormalization scales of the 
strong coupling in the various terms. Simply put, the same logic as in the LO case, 
Section 5.5.1, will apply. Starting from Born-type configurations, the partons are clus- 
tered backwards, yielding, as before, a parton shower history for the configuration. Its 
nodal values, t; will be used in the Sudakov rejection factors, and, together with the 
rest of the kinematics of the ith splitting the corresponding renormalization scale uÊ 
can be deduced. As in Eq. (5.345), then the overall scale for a process with M powers 
of the strong coupling in the 2 — 2 core process and with m additional QCD emissions 
will be given as 


ag!" (We) = O8 (We) = as" (HR(core)) © [| oseko). (3960) 


This scale is also used for the strong coupling related to loop corrections or to the 
soft emissions below the jet-threshold encoded for instance in the terms Hy. They are 
effectively ignored in the construction of the parton shower history, which relies on 
tree-level configurations only. 

When performing a scale variation of the renormalization scale in order to assess 
the corresponding uncertainty, one must therefore modify the Born matrix element by 
a factor such that 


: a(i? Z 7 
au) — az (ik) (: — 28D) 5, tog ay | (5.368) 
i=1 R 


while all higher order terms are just evaluated with the coupling taken at the new 
scale ji7,. This is slightly different from the scale setting prescription in MINLO, where 
the value of a, chosen for the additional real or virtual correction is fixed to be the 
average of all other values of the strong coupling, cf. Eq. (5.375). 

In a similar way, a new factorization scale can be chosen for the matrix elements, 
but must be compensated for by a term of the form 


ere 
Z 


1 1 
Bn logy Si J EPa) ejha (=, ab) + > J EPa falh (2 jij) 
d=4,9 t, 


(5.369) 


5.6 NNLO and parton showers 


Building on the algorithms and technologies discussed so far, the first implementations 
of combinations of calculations at next-to-next-to leading order (NNLO) in the strong 
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coupling for simple processes — essentially the production of simple colour-singlet 
systems — and parton showers have emerged. The first of them, dubbed MINLO, 
builds on a scale-setting prescription similar to the one employed in MEPS@NLO; 
its alternative, UNNLOPS in turn employs technology imported from the UNLOPS 
approach to multijet merging of NLO-QCD matrix elements. Both will briefly be 
sketched in the remainder of this chapter. 


5.6.1 Multijet merging without scales: MINLO 
5.6.1.1 Underlying ideas 


The MINLO ideas have been worked out in [611] (MINLO-1) and [609] (MINLO-2). 

The first instance focuses on improving the behaviour of fixed-order calculations in 
the soft region of parton emission, where they usually show unphysical, i.e. diverging, 
behaviour. This is achieved by a combination of clever choices for the scale of the 
strong coupling and the modification of the matrix elements by Sudakov form factors, 
similar to the programme already introduced in multijet merging at leading order, 
cf. Section 5.5.1. The main complication, though, that needed to be addressed stems 
from the fact that in contrast to the leading-order matrix elements, on which the 
CKKW method [351], operated, MINLO-1 aims at modifying next-to-leading order 
matrix elements. 

MINLO-2 pushes the ideas in MINLO-1 one step further, by reweighting the ma- 
trix elements with Sudakov form factors not at next-to-leading logarithmic accuracy 
(NLL) but actually at the next-to-next-to-leading logarithmic accuracy (NNLL). This 
ingenious idea allows the transverse momentum of the softest parton of the Born- 
level configuration to include scales of the order the parton shower cut-off te, around 
1 GeV. In turn MINLO-2 feeds such configurations directly into the parton shower 
through the POWHEG method, thereby essentially taking into account NLO correc- 
tions for processes such as Drell-Yan or Higgs production in association with one 
potentially very soft parton in a full hadron-level simulation. 


5.6.1.2 MINLOo-1 


As already stated, the primary aim of MINLO-1 is to improve the behaviour of sin- 
gle fixed-order calculations at NLO accuracy in the strong coupling in the Sudakov 
region, without upsetting their formal NLO accuracy. This is an important step to- 
wards improved stability of NLO calculations where a heavy system such as, e.g., a 
gauge or a Higgs boson is produced in association with light jets. In such cases, the 
stability of the calculation suffers with decreasing transverse momenta of the heavy 
system, or, correspondingly, of the light jets, due to the emergence of increasingly 
large logarithms, which must be resummed to all orders. As already encountered in 
previous sections, a convenient way to achieve this stabilization is through inclusion of 
now familiar Sudakov form factors. It should thus not be a big surprise that successful 
algorithms achieving this aim, such as the MINLO-1 method, are further developing 
ideas imported from multijet merging, presented in Section 5.5.3, but usually without 
the additional intricacies related to the combination of various NLO calculations for 
increasing multiplicities. 
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There are two ingredients in MINLO-1, already familiar from multijet merging. 
First of all, the renormalization and factorization scales are chosen to capture better 
the interplay of various scales in the process, and, secondly, the introduction of Su- 
dakov weights which tame the instabilities emerging in the regime of small transverse 
momenta. This proceeds by constructing a parton shower history for the underlying 
Born-level configuration, omitting the softest emission in the real-emission contribu- 
tion. As before, a strong ordering of the N nodal values in the hardness scale for the 
emissions is assumed, 


Qo = Qi = Q2... > Qn-1 = Qn = Qot, (5.370) 


where Qo denotes the hardest scale of the core process. For instance, in the case of 
the production of a heavy singlet in association with a number of jets, Qo typically is 
the mass of the singlet. The weight corresponding to the individual phase-space point 
is multiplied with an overall suppression factor, given by a product of Sudakov form 
factors of all internal lines i of the Born configuration and a product of all Sudakov 
form factors of all outgoing partons k of the Born configuration, which have been 
produced through a splitting at the scale Qk, 


N 
= II Ai( AA ][4 k(Q?, ADE (5.371) 
i=1 k 


Here, the subscripts i and k in A; and Ay, denote the flavour of the internal and 
external lines and the analytic Sudakov form factors from Eq. (2.183) are employed. 
The two external partons emerging at oe last branching, i.e. at scale Qn, will be 
associated with a Sudakov factor of A, (Qu, Qu) = 1. 

In order to compensate for higher-order effects induced by the Sudakov form fac- 
tors, the Born-term is modified by a factor which captures the first order in the a,- 
expansion of the analytic Sudakov form factors above. For each of the Sudakov form 
factors, the corresponding first-order term in its expansion is denoted by A“). With 
this notation, the correction factor is given by 


1 — DL BOG rig) FRA (Qi, ae) 
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(5.372) 


where the T; are the integrated splitting functions introduced in Eq. (2.178), and 


OOS -f dai al Dlg, q2). (5.373) 
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Here, the first sum ranges over all internal lines 7, while the second sum includes all 
outgoing parton of the Born configuration. This factor is in complete analogy to the 
one in Eq. (5.365), see also Section 5.5.3. 

To see how the scale setting works out for the MINLO method, consider again a 
Born-level configuration with M powers of the strong coupling related to the core 
process and with m QCD emissions with corresponding scales upr- In tree-level 
multijet merging methods, the overall scale upr is implicitly given by 


aX +N (u2) = a (LR, (core)) II As(MR,(3)) » (5.374) 


cf. Eq. (5.345). 

It may seem natural to use this scale also as the argument of the strong coupling 
in the real or virtual correction, and in fact, this is the way multijet merging methods 
at NLO choose the scale, cf. Section 5.5.3. In slight contrast, in MINLO-1 the average 
value of Qs, 


1 
(M+N+1) _ 2 >D ; 
Qs za M4+n Mas(HR,(core)) T 2 As(UR,(i)) ’ (5.375) 


is employed for them; the rationale for this choice is detailed in the publication [611]. 


5.6.1.3 MINLO-2: first implementation of NNLOPS 


Up to now, the MINLO-2 method has been formulated for the production of singlet 
systems only, for reason that will become obvious. The idea in it is to basically feed 
the next-to-leading order expressions in MINLO-1 for the production of such a singlet 
system S' in association with a parton j into the POWHEG formalism, thereby providing 
the link to the parton shower. By allowing the parton j to become as soft or collinear 
as the parton shower cut-off te allows, the full one-parton emission phase space is filled 
at NLO accuracy, and the second parton is distributed according to LO accuracy. This 
is possible by reweighting the Born-level singlet plus jet configuration with a Sudakov 
form factor at O (a2) accuracy, i.e. including terms A> and B2 from Q--resummation, 
thereby compensating all dangerous logarithms of the low scale. The catch then is to 
reweight the emergent event sample to a suitable distribution of the singlet at NNLO 
accuracy, to achieve the overall NNLO+P%S accuracy. 

Taking as an illustrative example for this procedure the production of a Higgs bo- 
son in conjunction with a parton, the B term for H + j production can be written in 
a similar way as before, including the Born term plus the real and virtual corrections. 
Following the reasoning of MINLO-1 above, however, Sudakov form factors are invoked 
to account for higher-order corrections which become large for small transverse mo- 
menta, and their interplay with the genuine higher-order terms must be accounted for 
by subtracting out their respective first order expansion. Taken together, and making 
all factors of as explicit, 
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B(s) = as(m3) a5(q1) AG (mz, Q1) - fB) [1 - 2AM mi, @3)] 


+ a,(Q? ) Ps) + JRE x ®ı) \ , 
(5.376) 


where Q, is the transverse momentum of the Higgs boson (and therefore, at Born- 
level, the extra parton) with Q? > te. 

Integrating Eq. (5.376) over the full phase space of the Born-level H + j configura- 
tion will therefore, by construction, yield the cross-section for H +j at NLO accuracy, 
including all terms that are singular in the limit Q, — 0, giving rise to logarithms 
of Q,. All singular terms, formally up to O (a2) with respect to the inclusive Higgs 
boson production, can of course also be obtained through the Q7-resummation for- 
malism at NNLL accuracy, which recovers all these logarithms. These are all terms 
of the form a,L? and asL as well as a2L*, a2L?, a2L?, and a2L, where the short- 
hand notation L = log(Q?/Q? ) has been used. In Qr resummation the resummation 
scale Q typically is identified to be of order of the singlet mass, in this case therefore 
Q =O (mz). 

Therefore, including also the Aj and Bə terms into the reweighting Sudakov form 
factors above, and fixing the scale of as in the real and virtual contributions to Q? 
guarantees that the result not only is NLO-accurate for H + j, but also for inclusive 
H production. 

There is one minor problem remaining, though, namely that the Sudakov form 
factors and coefficients given in Section 5.2.1, Eq. (5.65), Eq. (5.66), and Eq. (5.72), 
are for Qr resummation in the conjugate b,-space. Here, however, the Sudakov form 
factors are meant to be directly applied in transverse momentum space, and therefore 
a translation between both must be applied. As noted in [609], this actually has been 
worked out in [505] and essentially leads to adding a term 

ABE = 4¢(3) Co 


2,b1 >q 


(5.377) 


with Als ) the corresponding soft coefficient, while the Ag remain unaltered. The B2 
coefficients in the case of resummation directly in Qr-space therefore are given by 


Bee S Bee AB (5.378) 


These terms are used in the analytic NNLL Sudakov form factors, multiplied to 
the matrix element, of the MINLO method such that 
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Bq) = Bı 


with the coefficients A from Eq. (5.65), Bı from Eq. (5.67), and the term Bz from 
Eq. (5.378) with the Bə term in b1 -space given in Eq. (5.72). Note that the latter also 
include finite loop-correction terms, making them process-dependent. 

Finally, with both the inclusive process — in the example, inclusive Higgs produc- 
tion — and the process with an additional jet — H+ jet production — NLO correct, 
a seamless merging of the two multiplicities has been successfully achieved. This hap- 
pened, and it is important to stress this, without any jet cut like in the usual multijet 
merging. Instead, by carefully adjusting Sudakov weights and scales in a, it was pos- 
sible to modify the H+ jet part in such a way that it automatically also accounts for 
the inclusive part. 

In [610] it was then further realized that this could be turned into a full parton- 
level simulation, but accurate not only at NLO for both H and H+ jet production, 
but in fact accurate at NNLO in the inclusive cross-section. In order to achieve this, 
the rapidity distribution of the Higgs boson in the seamless merged sample merely 
has to be reweighted to the NNLO distribution. The same algorithm was also applied 
in a follow-up study, concerning Drell-Yan production [652]. There, the reweighting 
proceeds in three dimensions, thereby capturing the dynamics of the lepton system. 


5.6.2 An alternative NNLO+PS implementation: UNNLOPS 


To see how the UNNLOPS method [633, 634] works, consider first the simpler NLO 
case, known as UNLOPS. The idea there is to decompose the radiation pattern of 
all emissions into one part, where no emission happens and a complementary part 
capturing all emissions, and in particular at least one. Omitting the by-now obvious 
arguments of phase space for the different fixed-order contributions, the expectation 
value of an observable O is given by 


(0) = [aes By- | aeyRw + [ave 1—An(ti, HO)| Rw (Gr) ? O(s) 
k 


te 


ag faor Ax (ti, HO) Rw Eiu O): 
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Here, B is the differential NLO cross-section for Born-level kinematics from Eq. (5.326), 
Bn(®z) = B(®z) + Vn (®p) + fa [Rs ® ®,) — Sn (Ëg ® ®,) n (5.381) 


The argument of the observable indicates the particle composition and phase space 
from which it is evaluated. In view of this, the first line contains the observable after 
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no emission has taken place — the first term is the fixed-order result for a Born-level 
configuration where all fixed-order parton emissions above te in the real correction have 
been subtracted, such that the one potentially remaining emission is below the parton 
shower cut-off. The second term in the first line refers to those events, which, starting 
from a real correction with a parton above the parton shower cut-off where vetoed 
due to an unwanted emission harder than tı. In this case the parton configuration 
is projected onto the underlying Born configuration. The second line refers to those 
events which starting from a real-emission configuration experienced a parton shower 
evolution without unwanted emissions. This is signalled by the parton shower evolution 
operator EM (t, te; O), defined in full analogy to the Eq. (5.310), 


EX) (t, te; O) = A% (t, te) O(®n) 


+/ d&, [Kn(1) AWE, t(®1)) @ EL (t(4)), te; O) 


While the expression in Eq. (5.381) captures all relevant terms to maintain fixed- 
order accuracy, it does not account for the dependence of the real-emission contribu- 
tion Ry on the renormalization and factorization scales. This can trivially remedied, 
however, by suitable weights, see also the original literature. 

The same logic has been extended to match the parton shower to NNLO calcu- 
lations for the production of colour singlets, performed with qr subtraction. There, 
contributions with exactly zero, exactly one, and more than one emissions above te 
have been identified, leading to a somewhat lengthy expression, for which we refer to 
the literature. 


6 
Parton Distribution Functions 


Parton Distribution Functions (PDFs) are a necessary ingredient in the calculation 
of particle cross-sections at collider experiments with hadron beams. 
In Chapter 2, the master formula Eq. (2.52) 


1 
o = S fact f fajm (as) Fora bs er) atan (ars He) (6.1) 
0 


a,b 


was introduced to describe the calculation of the production of n-parton final states 
in processes of the type hyhg —> n+ X with incoming hadrons hı and ha — usually 
protons. The calculation of such a hard (parton-level) cross-section within perturbative 
QCD relies on also using partons in the inital state and thus requires the knowledge 
of the distribution of partons 7 in the hadrons h. This is most often achieved by 
using a scheme called collinear factorization in which this distribution depends on the 
longitudinal momentum fraction x of the partons with respect to the hadron, taken 
at the factorization scale pip. The resulting distributions fijn(x, wr) are known 
as parton distribution functions (PDFs), and parameterize the transition of incident 
hadrons to incident partons, thereby absorbing all emissions at scales below pr, cf. 
Chapter 2. In turn, the PDFs obviously depend upon this measure of the hardness of 
the parton-—level interaction, which will be of the same order as the renormalization 
scale wr. As a consequence of the factorization of possible parton emissions into a 
soft and collinear part, below up, and a hard part, above up, the latter are explicitly 
described by the matrix element. Factorization theorems, proven for deep-—inelastic 
lepton—proton scattering and for Drell-Yan production of gauge bosons in hadron 
collisions, assert that the PDFs in collinear factorization are process—independent. 
There are however terms of higher dimension — so called higher—-twist contributions — 
that are process dependent. These terms are ignored throughout this book, since they 
are suppressed by factors of the type m*/. Note that in this chapter both yz and 
Lr will often be represented just as Q. 

Pictorially, as was shown in Fig. 2.5 (left), more of the quantum fluctuations can be 
resolved as the hardness scale, corresponding to the inverse of the time scale, increases. 
Low-scale processes can only resolve longer time intervals, and with increasing scale, 
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smaller time intervals are being probed, and more of the quantum fluctuations inside 
the hadron are resolved, which transfer momentum from high-z partons to low—x 
partons. Thus, at higher upr one expects to find more of the momentum of the proton 
given to gluons and sea quarks with relatively low values of the momentum fraction 
x, as the quantum fluctuations creating these partons can be resolved. Conversely, the 
population of partons containing a large fraction of the parent hadron’s momentum 
will decrease. 

The PDFs describing this dynamics are treated as being universal, i.e. they can 
be determined by one set of processes and used to predict the cross-section for any 
process, using the factorized form shown above. These parton distribution functions 
currently can not be calculated perturbatively; ultimately, however, it may become 
possible in the future to calculate these PDFs non-perturbatively, using lattice gauge 
theory [310]. On the other hand, the evolution of PDFs with Q? can be calculated 
with a perturbative treatment using the DGLAP evolution equations, as sketched 
in Section 2.1.3 and discussed in more detail in Section 6.1. 

PDFs have been determined by global fits to data from a plethora of different 
data. Modern global PDF fits use of the order of 3000 data points from a number 
of processes from fixed target experiments, the TEVATRON, HERA, and the LHC. In 
particular, data from deep-inelastic scattering (DIS), Drell-Yan (DY) and jet pro- 
duction processes, have played a dominant role in the past. With the advent of the 
LHC and its huge amount of data, also processes such as y+jet, W+jets, and heavy 
flavour production (including tt) have become increasingly important as the statistical 
and systematic errors of the data sets have improved [827]. Many of these processes 
have recently been calculated to NNLO, allowing the data to be used in PDF fits at 
that order. At the moment this endeavour is actively pursued by a number of differ- 
ent groups: ABM [136], CTEQ/CT [489, 551], HERAPDF [115, 413, 819], JR [649], 
MSTW/MMHT (614, 755] and NNPDF [192, 194], which provide semi-regular updates 
to their fits of parton distributions, when new data and/or theoretical developments 
become available. Similarities and differences in the fitting procedure performed by 
the different groups will be discussed in some detail in Section 6.2. 

The most commonly used are those PDFs which perform a global analysis on a 
broad variety of data using a variable flavour number scheme; the most recent updates 
(CT14, MMHT2014, NNPDF3.0) are given in [194, 489, 614]. The resulting PDFs 
are available at leading order (LO), next-to-leading order (NLO), and next-to-next-to- 
leading order (NNLO) in the strong coupling constant ag, depending on the order(s) at 
which the global PDF fits have been carried out. Some PDFs have also been produced 
at what has been termed modified leading order [715, 846] (LO* or similar), in an 
attempt to reduce some of the problems that result from the use of LO PDFs in parton 
shower Monte Carlo programs. These PDFs are no longer in wide use, however. The 
choices of parameterization for these PDFs are then discussed, along with the impact 
of the use of particular renormalization and factorization schemes. Given the wide 
kinematic coverage of the data, the parameterization of the PDFs must be flexible 
enough to describe the parton distributions over a wide range of x and Q?, and to not 
introduce artificial correlations between different x regions. Modern PDFs take into 
account the finite charm and bottom quark masses (using a variety of heavy quark 
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mass schemes) in their fits to data, particularly to data from deep-inelastic scattering. 
This has consequences for low-mass cross-sections at the LHC that will be examined. 
Technical aspects related to the different orders, schemes and parameterizations are 
highlighted in Section 6.2.2. 

There are two different classes of technique employed for the PDF determination: 
those based on the Hessian approach, and those using a Monte Carlo approach. The 
background behind both classes will be discussed in this chapter, and some of the actual 
procedures employed in the fitting of the PDFs and the determination of the respective 
errors will be highlighted. An important consequence of the method chosen is the way 
uncertainties in the fits are handled. These uncertainties also have implications on the 
accuracy of overall cross-section calculations. This is followed by a similar discussion 
of the choice of the value of a,(mz) in the global fits. The data in the global fits are 
mostly from strong interaction physics, and they are thus sensitive to the exact value 
of as(mz). As a consequence, the global fit itself can be used to determine it. Since 
Qs(mz) is a universal parameter, however, another approach is to assume the world 
average value, and to not let a, be a free parameter in the fit. Both approaches can be 
and have been used. Recently, most groups have provided PDFs at the world average 
value of as(mz), along with PDFs at alternate values of as(mz). Issues related to the 
fitting technology, impacting on estimates of the intrinisic uncertainties of PDFs and 
their impact on cross-sections are discussed in Section 6.3. 

The next section is devoted to a discussion of PDF correlations. The examination 
and understanding of correlations between two fitted PDFs, or between a PDF and 
a cross-section, or between two cross-sections is crucial to a detailed appreciation of 
PDFs at the LHC. In adition, such correlations can be used to decrease the PDF 
uncertainties for the ratios of such quantities at the LHC. In Section 6.4, the resultant 
PDFs at LO (modified LO), NLO, and NNLO are presented. Parton luminosities are 
defined and are shown for several important initial states at the LHC in Section 6.5. 
PDF uncertainties using inputs from several PDF groups are then defined, using the 
PDF4LHC accords. Correlations among LHC processes are examined using these PDF 
combinations, and finally, in Section 6.6 several useful PDF tools that are currently 
available are described. 


6.1 PDF evolution: the DGLAP equation revisited 
6.1.1 PDF evolution at leading order 


An essential feature of the PDFs is the fact that their evolution with energy can be 
calculated within the framework of perturbative QCD, as has already been outlined in 
Chapter 2. In that chapter the equation governing this evolution — the QCD DGLAP 
equation — was first introduced. At the leading order it reads, 


ð Ce ay 
ð log Q? Forh tt; Q’) 
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The leading order splitting functions Pe have previously been given in Eq. (2.33). 
They represent the kernel of this evolution equation and clearly couple different PDFs 
together. Note that the symmetries of QCD mean that, at this order, splitting functions 
involving anti-quarks are simply equal to their quark counterparts. Before turning to 
the form of the DGLAP equation and the splitting kernels at higher orders in the next 
section, the calculation of the kernels P;; at leading order will be sketched below. 


6.1.1.1 Factorization of initial state emissions in the collinear limit 


The starting point for the calculation of the splitting kernels is the collinear limit of 
particle emission off initial-state partons. In order to see how this works, assume the 
production of a system X with mass Mx in the scattering of two incident partons j 
and 7’ with momenta p and p’, 
8 8 

pt = 2 (1,0,0,1) and p” = YÊ ,0,0,-1), (6.3) 
where § = (p+ p’)?. The differential parton—level cross-section for this process, at 
leading order, is 

- (0) nl (0) n|? 


Assume now that parton j emits a parton k with momentum k into the final state, 
turning into a parton į with reduced four momentum p -— k, with (p — k)? < 0, which 
together with 7’ produces X. The momentum k is then given by 


k" = (1— x)p" + bp" + kf, (6.5) 


where kı is perpendicular to both p and p’. 8 is fixed by the on-shell condition for k, 
k? = 0, and hence 


2 


k 
k” = (1- x) p“ 4 a S pH +k. (6.6) 


Here k? denotes the positive square of the transverse momentum. Similarly, 


ki 


2 — k) = 
alt T 


; (6.7) 


showing that the propagator of i is proportional to k. As a consequence, the corre- 
sponding matrix element for the process under emission of k diverges with ky — 0. 
This is the collinear limit, where the transverse momentum of the emitted particle 
vanishes. This singular behaviour also triggers divergences in the phase space integra- 
tion over k. All these divergences however cancel, according to the Kinoshita—Lee- 
Nauenberg theorem, and as seen previously, when virtual corrections are added. 


404 Parton Distribution Functions 


To proceed, the phase space integral over particle k has to be cast into a form 
useful for further consideration. With a bit of algebra it is easy to show that 


atk ; ; X 
dadk? 
a e (6.8) 


Adding in the emission of the final state parton k, and ignoring all other possible 
emissions, yields a higher-order contribution to the cross-section of X production, and 
Eq. (6.4) becomes 


A 1 : 
65 4x(p, p') = Mi) x(P, p’) 


1 
1 dx 0) 2 
+ 35 ie dex laz fes] ME rik okx D P) - 
(6.9) 


The leading-order splitting functions PY a ) in collinear factorization are then 
defined as the part of the second line of the equation above that diverges in the 


collinear limit: 


2 


1 1 i 2 
PP () [Mix | = iga im, he LM? singe sex (Ps p| |. 
(6.10) 


The x in the denominator of the left-hand side of the equation arises from the fact 
that the matrix element for ij’ — X has an incoming flux given by 2x8 instead of the 
original 28. 


1-2 as 


x Qn 


6.1.1.2 Calculating the kernels: PW 


To calculate the splitting function PY, i and j are quark lines and k is a gluon. 
Clearly, gauge invariance of the overall matrix element for the associated production 
of X and the gluon k is only guaranteed if amplitudes for the emission of the gluon 
off all coloured lines are included. The question is whether these contributions exhibit 
collinear divergent behaviour. 

It can be shown that this is not the case when choosing a smart physical gauge 
where the gluon’s polarization vector e” is transverse to both p’ and k. To see this, 
an additional axis nı L ky with nî = 1 is introduced, which allows the explicit 
construction of the polarization vectors as 


2k 1 
t v2 L yH | ki Æ inf. (6.11) 
(1— x) V2kı 


In addition to the orthogonality requirements underlying their construction, 
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ep Sek = 0; (6.12) 


they satisfy : 
1 

e+- kı 2 and é+: p Di (6.13) 
While the square of the graph with the propagator of i boasts two propagators and is 
thus proportional to 1/k‘4, all other graphs have at best one of these propagators. And 
since in the squared amplitude there are two scalar products involving the polarization 
vector, the numerator of the squared amplitude is proportional to k4. This leaves no 
overall divergence in 1/k? the squared amplitude apart from the one stemming from 
the graph with the 7-propagator. 

Analysing the structure of the reduced amplitude Me 
its square can be written as 


ij ) x, it becomes clear that 


2 
IMO) x| = alep, A) bu M"] u(zp, à), (6.14) 


where the spinors refer to the intermediate quark i. Because it is a massless quark, 
and since incoming and outgoing spinors come with the same helicity, this is the only 
allowed structure. Decomposing M according to 


M” = ap" + bp" + mf (6.15) 


with mı a vector in the same transverse plane as kų and using the Dirac equatoin 
elimiates the term proportional to p. Also, since the helicities are the same, the spin- 
flip operator y -m vanishes when sandwiched between the two spinors. In the end, 
therefore 


[MO x|? = aap, X bubo] ulen, 2). (6.16) 
realizing that 
lū(zp, A) (y-p’) u(ap, A)|? = (2zpp')? = 278? (6.17) 
identifies b and therefore T 5 
M# = on ag (6.18) 


These identities can now be rolled out to the calculation of the emission matrix 
element in the collinear limit. Including factors of 1/2 and 1/N, for the average over 
incoming quark helicities and colours, 


(0 
AG eee Cs) p’) 


ge p-# [# 


g°Tr [ TT ' : r 
= sa iy or H-P G- HF] [MO x 


| 2 
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Here, the following identities have been used: 


k2 
2kp = —+ 
j l-z 
2kp' = ê(1 — x) 
$ kapl, + kup’ 
So REE = -gw J iy E (6.20) 
Inserting this into Eq. (6.10) yields 
l-ra 1 29°C 8BTAsCpr(1 +z?) 
Os POf) = E N E a Th ae 6.21 
s a (2) 1672 omer reas) 16722 ( ) 
and therefore 
l+e« 
P® (£) = Cp ee (6.22) 


as expected from Eq. (2.33). 


6.1.1.3 Virtual contributions: The “+”-function and all that 


Having arrived at the expression in Eq. (6.22) is not the end of the story, though. 
Taking a closer look at the expression for PO shows the by now well-anticipated 
divergent behaviour for x — 1, or, for vanishing gluon energies. In this limit of course 
also the transverse momentum vanishes, leading, ultimately, to the usual double soft 
and collinear divergence. Of course, these divergences will cancel, when taking into 
account the virtual contributions, a feature that consistently appeared throughout the 
book. Formalizing this idea results in 


1 
mes fe [Srp (x) |Ma; uxlap? + (1+V) |Ma; >x (£p)? 
0 


1 
= faf; SS PO (a) + 6(L—2)(1+V)] Maysx(e)P, (6.23) 
0 

where V is the virtual contribution at order a,. This contribution can be obtained by 

invoking a simple probabilistic idea, similar to what has been seen in the construction 


of parton showers. To order as, the probability of a gluon emission by the quark plus 
the probability of no emission must add to unity. Hence, 


fæ [St PM(e) + 61—2)(+V)] =1 (6.24) 
0 
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and therefore the virtual part must compensate the real emission encoded in the 
splitting function: 


1-6 
sad te Ss (1) 
V = F lim dz Pig (2) 
0 
C n 2 Cr (3 
AsCOF ,. AsUF 
= l dx | —— -— (1 = ~+2logd). (6.25 
Qn 5-0 z ( ( +2) On (5+ on on) 
This implies that the complete splitting function is given by 
: 142? 3 l 
Pal) = Cr lim | Tae O(1— zx -— ô) + (1-2) € | 2108) | (6.26) 


This looks quite cumbersome. A solution, however, presents itself by realizing that 
the “splitting functions” P;; actually are not really functions but in fact distributions 
that will always act on some other functions f(x) such as PDFs or similar. These 
functions are typically regular at x = 1 and it is therefore meaningful to define a 
way to regulate the divergent behaviour of the P;; at x = 1 independent of the limit- 
procedure above. Introducing the “+” prescription, applicable for distributions that 
diverge at x = 1, such as 1/(1 — x), 


= : iaa O0, cok 


l-r 1-2 


cf. Eq. (2.10). This prescription will take care of the log d-term, located at z = 1 by 
virtue of the 6-function and therefore, finally, 


1+2? 3 
Pal) = Cr (= X H zQ ») ; (6.28) 
This actually implies that the x—integral over Pj; vanishes exactly, 
1 
J de P, q(t) = 0, (6.29) 
0 


thereby incorporating the exact cancellation of real and virtual contributions. 


6.1.2 PDF evolution at higher orders 


PDF evolution at the next order in perturbation theory is achieved by expanding the 
splitting functions as a series in the strong coupling, 


pl) + O(a?) (6.30) 


— pi) 
Pela) = Pi + on 
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Fig. 6.1 Processes corresponding to the NLO splitting functions Pia 


(left) and PE (right). Unobserved partons are indicated in the figure by 
parentheses. 


The inclusion of higher-order terms, p. introduces several subtleties that did not 
arise at leading order. The first is that, at leading order, the flavour structure of 
the evolution is trivial: for two flavours of quark, q; and q,;, the splitting function 
trivially vanishes unless i = j, i.e. PH, x dij. This is no longer true at the next order, 
as indicated in Fig. 6.1 (a). Contributions originating from the splitting of a virtual 
gluon into a quark-antiquark pair of different flavour, q; > q; (q4;), give rise to kernels 
that are no longer diagonal in flavour space. Moreover, the same process also gives rise 
to kernels that directly couple quarks and antiquarks, q; > @;(qiq;), cf. Fig. 6.1 (b). 

Therefore Eq. (6.2) must be generalized in order to account for effects at higher 
orders. The evolution can be written as, 


Saint, Q’) 
o 
aG, Q?) 


l 2 
ER fonha, Q’) 6 31) 
as(Q?) i dz Pasar ty Paid 2) Pag z) lentil? Q”) 
= on J z | Pro (2) Pua (2) Pas (2) | | fant Q) J 


z Poar (2) Pode (=) Pog (2) fajn(2, Q?) 


with all of the distributions coupled together. The presence of some amount of anti- 
quarks in the parent quark means that there is no physically-meaningful separation 
into “valence” and “sea” contributions beyond leading order. However it is possible to 
identify combinations of quark PDFs that can be identified with the valence contribu- 
tion as follows. The nature of the leading order splitting functions suggests that they 
be decomposed according to, 

Pua = PY + PH 


Pad; = 5ijP ig PÈ. (6.32) 


In order to solve the coupled evolution equations, Eq. (6.31), it is useful to introduce 
the following combinations of the splitting functions [550], 


Pa) = Ph EPK 


qq? 
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Pag = 2nfPag 
Pog = Poa» (6.33) 


for ny flavours of light quarks. Similarly it is convenient to express the parton distri- 
butions in terms of the following quantities, 


fy jp, Q’) = fa ph(2, Q?) an fagnl2, Q’) , (6.34) 
nf 

fa n(2@, Q?) = 5 FE m Q’) (6.35) 
t=1 


that is, into either a sum or difference of quark and antiquark distributions. The 
evolution expressed in Eq. (6.31) can now be written in a much simpler form in which 
the components decouple significantly. Two combinations satisfy a particularly simple 
form of the evolution, 


ð D as(Q”) dz 2 
log Q? fy x; Q*) = ar, FZ POF Ome Q“) (6.36) 
0 l ON as(Q?) dz , 2 
log Q? Si(z, Q ) ae 7 Psie, Q ) , (6.37) 
where the quantity S;(x, Q?) is given by, 
Si(z, Q’) = ng fy jn (@ Q’) = fo sn(2, Q’) . (6.38) 


These evolution equations can be solved separately; the solutions for fo MAGZ Q?) 


and S(x, Q?) are referred to as non-singlet contributions. The remaining quark 
singlet combination, fo /p(z, Q?), remains coupled to the gluon contributions, but 
in a fashion akin to the leading order evolution (cf. Eq. (6.2)), 


0 Gey a) 
Dlog Q2 \ falz, Q?) 


_ %(Q’) j dz ee, ) Pas R) o one 


27 z \Pa (2) Pag ( ) fajn(z, Q?) 


(6.39) 


Eqs. (6.36), (6.37), and (6.39) are sufficient to determine the evolution of all of the 
individual PDFs at a given order, if the corresponding splitting functions are known 
to the same accuracy. 

The first corrections to the leading-order splitting functions were originally com- 
puted in Refs. [421, 549]. Their forms are reproduced here for completeness. The two- 
loop gluon splitting function receives contributions from two different colour structures, 
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P (2) = CaP (2) + TrngPon(z) (6.40) 
that are given by, 
P®,(2)= 14 [pw 6(1—2z)(Ca(-143 
oga (2) = 8 Ca gg (z) aie ( —z)[ At + ¢3) + Bo] 
1 1 
(1) = = S Gora +) 2 
+ [P$ o. ( 2ln(1 — z) + 5 nz) ln z + E ( 2], (S2(2) + 3 ln z) 
2. 4(9+112°) 277 7 277 > 
+C4 (40+ 2)1m z 3 Inz igaz 7 1902 z)t+ ig” 
| 13 3 13 2 j f 
H Bl 5 (1 z) a? T (14 z)Inz), (6.41) 


4 
PE(z) = Cr| §(1-2)+ = —-16+82+ 527-214 z)In? z — 2(3 + 5z) Inz], 


where the plus-part of the first-order splitting functions has been defined previously in 
Eq. (2.33). This equation also introduces both the two-loop cusp anomalous dimension, 


Tı = $ (Ca(4— 1”) +580) (6.42) 


as well as the function, 


So(z) = —2 Lig(—z) — 2ln(1 + z) ln z — a i (6.43) 


The corresponding result for PÈ (2) is, 


PĒ (2) = caf PING) [mq z) —2In(1—z)Inz ~ T + PO) (—z)$9(z) 
Op (22m0 E E NEn pee igi = = =)} 
= Ce} PAM In?(1 — z) + BP (2) + 22Cr] In(1 — 2) 
Cr( 757 inte tt inet eh 
fs Bof PW (z) [ma —z)+ 4 + z} . (6.44) 


The quark splitting functions employ the decomposition of Eq. (6.32). The com- 
ponents are, 


Ty (1 + A) 


(2) eel 
Piv (2) = 8 Geri = z)4 
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+ô(1— 2)CF ae z is + 663) +0a(5 — 36s) + Bo(5 $ T) 


2 1 
Ge 2A 2In(1 — z) 4 S| mz+ 12 ty Sa TE 
l-z 2 2 
11 2 
+ CACr $ — mn? z+ (1+ 2)nz +30- 9] 
11+2? 
+60) 5 i =Ine+1- 2], 
(2) URE lnl + (1+2)Inz+2(1 
Prav (2) = (2Cr — Ca)Cry = [Sa(z) + 5m? 2] + (1+ 2)Inz+ 201-2) p, 


2 56 
pe) (z) = TRCF |-a +z) ln? z + (1 + 52+ D lnz + Ta 2+ 6z -— 7 ; 


qq5S 9z 
(6.45) 
and 
D(z) = : Gh gi ea + 2In(1 
Pi (z) = CrTp| (z + (1 z)?) (In 7 n— +5) + 2In(1 — z) 
L nz inet 2-52] 
22 1 a 
+CaTe{ (2 +0 2)*) | In?(1 — z) + 2In(1 — z) + 3 In z + 
+ (2? + (1+ z)”) So(z) — 2In(1 — z) — (1 + 2z) In? z 
68z — 19 20 91 7 
peoa iE, a paiak A 
— m+ + il (6.46) 


The solution of the PDF evolution equations given earlier, in the presence of these 
splitting functions, is obtained numerically. Some examples of the effects of DGLAP 
evolution on the PDFs will be given later in this chapter. 


6.2 Fitting parton distribution functions 
6.2.1 Processes involved in global analysis fits 
6.2.1.1 Interlude: Deep-inelastic scattering 


The parton distribution functions were introduced in Chapter 2 with reference to 
deep-inelastic scattering (DIS) of leptons and protons. These processes play an 
important role in the extraction of PDFs since, within the parton model, the PDFs 
are related in a fairly straightforward manner to cross-sections that can be measured 
experimentally. The kinematics of a DIS process can be described by the following 
variables: Q?, the square of the four-momentum transferred to the proton in the ex- 
change; x, the fraction of the proton’s momentum carried by the struck quark, y, the 
fraction of the incident lepton’s energy, in the proton’s rest frame, that is lost in the 
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collision. In terms of these variables the doubly-differential DIS cross-section can be 
written as, 


d?o O 2ra? 


T [(1+(-y)?) F- (1-(1-—y)}?) chs -y Fr] . (6.47) 


This introduces a parameterization of the cross-section in terms of the structure 
functions F>, F}, and Fr. This formula is correct for neutral current (NC) pro- 
cesses where the exchanged particle is a photon or a Z-boson. For charged current 
(CC) interactions the particle exchanged is a W-boson and, for a beam of the appro- 
priate helicity lepton to facilate the weak interaction, Eq. (6.47) must be multiplied 
by an additional coupling and propagator factor, 


2 
2 (Spri Q? Je (6.48) 


4ra Q?+mi, 


In the quark-parton model the relationship between the structure functions and 
the PDFs is: 


FXO =a 5° Cy [ fal, Q?) + fale, Q”)] (6.49) 
FSC = 3c [fa(x, Q’) — fa(z, Q?)] ’ (6.50) 


FOO = 2a [ful Q?) + fale, Q?) + fala, Q?) + fela, Q?) +...) (6.51) 
FeO) =2[fu(w,Q?) — fale, Q?) — Fle, Q?) + fel,Q?) +--] - (6.52) 


Here, Cy and Cj, represent the combinations of couplings and propagators necessary 
to account for the separate photon and Z-boson contributions to the neutral current 
process. In particular, F3 would be zero for the case of pure photon exchange. The 
charged-current process corresponds to the case of an incident electron, so that a W7 
is exchanged, with the corresponding result for Wt obtained by interchanging d © u 
and s + c. Since the contribution of sea quarks and anti-quarks is equal, measurements 
of FNC are particularly useful probes of the valence quark distributions. 


6.2.1.2 Interlude: End 


Measurements of deep-inelastic scattering (DIS) structure functions (F2, F3), or of the 
related cross-sections, in lepton-hadron scattering and of lepton pair production cross- 
sections in hadron-hadron collisions provide the main source of information on quark 
distributions f,/,(2, 7) inside hadrons. For simplicity, for the rest of this chapter, the 
scale up will be replaced by the scale Q, representing both the renormalization and 
factorization scales. 

At leading-order, the gluon distribution function f,/p)(z,Q?) enters directly in 
hadron-hadron scattering processes with jet final states. Modern global parton dis- 
tribution fits are carried out to NLO and NNLO, which allows as(Q°), fq/p(x, Q?) 
and fy/,(«,Q?) to all mix and contribute in the theoretical formulae for all processes. 
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Nevertheless, the broad picture described above still holds to some degree in global 
PDF analyses. Again, Q here refers to a scale representing the hardness of the inter- 
action, as for example the jet transverse momentum in a calculation of the inclusive 
jet cross-section. 

A NLO (NNLO) global PDF fit requires thousands of iterations and thus thousands 
of estimates of NLO (NNLO) matrix elements. The NLO (NNLO) matrix elements 
require too much time for evaluation to be used directly in global fits. Previously, a K- 
factor (NLO/LO or NNLO/LO) was calculated for each data point used in the global 
fit, and the LO matrix element (which can be calculated very quickly) was changed in 
the global fit (multiplied by the K-factor). Currently, a routine such as fastNLO [686] 
or Applgrid [329] is often used for fast evaluation of the NLO matrix element with 
the new iterated PDF. Practically speaking, both provide the same order of accuracy. 
Even when fastNLO or Applgrid is used at NLO, a K-factor approach (NNLO/NLO) 
is still needed at NNLO, at least until the fastNLO/Applgrid technique can be adapted 
for NNLO calculations (in progress at the completion of this book). 

The data from DIS, DY, and jet processes utilized in PDF fits cover a wide range 
in x and Q?. HERA data [77, 115] are predominantly at low x, while the fixed target 
DIS [169, 170, 217, 900] and DY [769, 874] data are at higher x. Collider jet data at 
both the TEVATRON and LHC [9, 61, 86, 103, 116, 117, 119, 126, 173, 360] cover a broad 
range in x and Q? by themselves and are particularly important in the determination 
of the high x gluon distribution. Jet data from the LHC have now been used in global 
PDF fits, and their importance will increase as high statistics data, and their detailed 
systematic error information, are published. In addition, jet production data from 
HERA have been used in the HERAPDF global PDF fits [78, 79, 389, 390]. 

As an example, the kinematic coverage of the data used in the NNPDF2.3 fit [192] 
is shown in Fig. 6.2. 

There is a tradeoff between the size and the consistency of a data set used in a 
global PDF fit, in that a wider data set contains more information, but information 
coming from different experiments may be partially inconsistent. Most of the fixed 
target data have been taken on nuclear targets and suffer from uncertainties in the 
nuclear corrections that must be made [693]. This is unfortunate as it is the neutrino 
fixed target data that provide most of the quark flavour differentiation, for example 
between up, down, and strange quarks. As LHC collider data become more copious, 
it may be possible to reduce the reliance on fixed target nuclear data. For example, 
the rapidity distributions for Wt, W7, and Z production at the LHC (as well as the 
TEVATRON) are proving to be very useful in constraining u and d valence and sea 
quarks, as described in Chapter 9. 

There is considerable overlap, however, for the kinematic coverage among the 
datasets with the degree of overlap increasing with time as the full statistics of the 
HERA experiments have been published. Parton distributions determined at a given 
x and Q? ‘feed-down’ or evolve to lower x values at higher Q? values, as discussed 
in Chapter 2. DGLAP-based NLO and NNLO pQCD should provide an accurate de- 
scription of the data (and of the evolution of the parton distributions) over the entire 
kinematic range present in current global fits. At very low x and Q?, DGLAP evolu- 
tion is believed to be no longer applicable and a BFKL [189, 517, 708, 709] description 
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Fig. 6.2 The kinematical coverage in x and Q? for the data sets in- 
cluded in the NNPDF2.3 global PDF fit. Reprinted with permission from 
Ref. [192]. 


should be used. No clear evidence of BFKL physics is seen in the current range of 
data; thus all global analyses use conventional DGLAP evolution of PDFs. 

There is a remarkable consistency between the data in the PDF fits and the pertur- 
bative QCD theory fit to them. The CT,MMHT, and NNPDF groups use over 3000 
data points in their global PDF analyses and the y?/DOF for the fit of theory to 
data is on the order of unity, for both the NLO and NNLO analyses. For most of the 
data points, the statistical errors are smaller than the systematic errors, so a proper 
treatment of the systematic errors and their bin-to-bin correlations is important. All 
modern day experiments provide the needed correlated systematic error information. 
The H1 and ZEUS experiments have combined the data from the two experiments 
from Run 1 at HERA (and now Run 2) in such a way as to reduce both the systematic 
and statistical errors, providing errors of both types of the order of a percent or less 
over much of the HERA kinematics [77]. In the Run 1 combination,for example, 1402 
data points are combined to form 742 cross-section measurements (including both neu- 
tral current and charged current cross-sections). The combined data sets, with their 
small statistical and systematic errors, form a very strong constraint for all modern 
global PDF fits. Thus, it can be hard for other data sets, for example from the LHC, 
to match the statistical and systematic errors of the HERA data. The manner of using 
the systematic errors in a global fit will be discussed later in Section 6.3. 

The accuracy of the extrapolation to higher Q? depends on the accuracy of the 
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original measurement, any uncertainty on ws(Q?) and the accuracy of the evolution 
code. Most global PDF analyses are carried out at NLO and NNLO. Both the NLO and 
the NNLO evolution codes have now been benchmarked against each other and found 
to be consistent [190, 191, 251, 262, 470, 572]. Most processes of interest have been 
calculated to NLO and there is the possibility, as discussed previously, of including data 
from these processes in global fits. Fewer processes have been calculated at NNLO [134, 
296]. The processes that have been calculated include DIS, DY, diphoton [340], tt 
production [427], and inclusive W,Z, and Higgs boson + jet production [269, 271, 272, 
274]. Late in the writing of this book, the complete NNLO inclusive jet production 
cross-section has been completed (a monumental feat), but the results are not yet in 
the form to be easily used in global PDF fits [422]. Typically, jet production has been 
included in global PDF fits using NLO matrix elements. Threshold corrections [445, 
676] can be used to make an approximate NNLO prediction, but the corrections are 
valid only over a limited phase space at the LHC, thus greatly reducing the size and 
power of the jet data in the global fits. Thus, any of the NNLO global PDF analyses 
discussed here are still approximate for this reason, but in practice the approximation 
should work reasonably well. The NNLO corrections for the inclusive jet cross-section 
have been found to be small (and relatively constant for the LHC phase space), if a scale 
equal to the transverse momentum of the jet is used [335]. The CT14, MMHT2014, 
and NNPDF3.0 PDFs follow different philosophies regarding the use of LHC jet data in 
NNLO fits. CT14 makes no cuts on the LHC jet data, MMHT2014 doesn’t include the 
LHC jet data, and NNPDF3.0 uses only the jet data for which threshold resummation 
provides a reasonable prediction. Current evolution programmes should be able to 
carry out the evolution using NLO and NNLO DGLAP to an accuracy of a few percent 
over the hadron collider kinematic range, except perhaps at very large and very small 
x. 

The kinematics appropriate for the production of a state of mass M and rapidity 
y at the LHC is shown in Fig. 6.3 [320]. For example, to produce a state of mass 
100 GeV and rapidity 2 requires partons of x values 0.05 and 0.001 at a Q? value of 
1 x 104 GeV?. Compare this figure to the scatterplot of the x and Q? range included 
in the recent NNPDF2.3 fit and it is clear that an extrapolation to higher Q? (M?) is 
required for predictions for many of the LHC processes of interest. As more Standard 
Model processes are included in global PDF fits, the need for extrapolation will be 
reduced. 


6.2.2 Parameterizations and schemes 


A global PDF analysis carried out at NLO or NNLO needs to be performed in a spe- 
cific renormalization and factorization scheme. The evolution kernels are calculated in 
a specific scheme and to maintain consistency, any hard scattering cross section calcu- 
lations used for the input processes or utilizing the resulting PDFs need to have been 
implemented in that same renormalization scheme. As we saw earlier in Chapter 2, 
one needs to specify a scheme or convention in subtracting the divergent terms from 
the PDFs. These divergent terms result from collinear gluon emission from the initial 


1The NNLO cross-section is higher than the NLO one at smaller jet transverse momenta if a scale 
equal to that of the largest transverse momentum jet in the event is used. 
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Fig. 6.3 A plot showing the x and Q? values needed for the colliding 
partons to produce a final state with mass M and rapidity y at the LHC 
(14 TeV). 


state partons. The collinear emissions result in pole terms of the form 1/e, where e 
is the dimensional regularization parameter. Basically the scheme definition specifies 
how much of the finite corrections to subtract along with the divergent pieces. Almost 
universally, the MS scheme is used; using dimensional regularization, in this scheme 
the pole terms and accompanying log 4r and Euler constant terms are subtracted.” 
PDFs are also available in the DIS scheme (where the full order a, corrections for Fz 


?Within the MS scheme, PDFs can also be defined for a fixed number of flavours, which then have 
a validity over the kinematic range for which that number (and only that number) of flavours can be 
present in the proton. 
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Fig. 6.4 CTEQ6.5 up and down quark distributions normalized to those 
of CTEQ6.1, showing the impact of the heavy quark mass corrections. 
Reprinted with permission from Ref. [876]. 


are absorbed into the quark PDFs). 

Basically all modern PDFs now incorporate a treatment of heavy quark effects in 
their fits, either via the ACOT general-mass (GM) variable flavour number scheme [131] 
(supplemented by by a unified treatment of both kinematical and dynamical effects 
using the S-ACOT [694] and ACOT-x [698, 875] concepts), used by CTEQ/CT, or by 
the Thorne-Roberts scheme [872, 873], used by both MSTW and HERAPDF, and the 
FONLL scheme, used by NNPDF [302, 532]. 

Incorporation of the full heavy-quark mass effects in the general-mass formalism 
suppresses the heavy flavour contributions to the DIS structure functions, especially 
at low x and Q?. In order for the theoretical calculations in the global fits to agree 
with the data in these kinematic regions, the contributions of the light quark and 
anti-quark PDFs must increase accordingly. This has a noticeable impact, especially 
on predictions for W and Z cross-sections at the LHC. 

Fig. 6.4 shows the impact of the heavy quark mass corrections on the up and down 
quark distributions for CTEQ6.5, at a Q value of 2 GeV [876]. The CTEQ6.5 up and 
down quark distributions are normalized to the corresponding ones from CTEQ6.1 
(which does not have the heavy quark mass corrections). The shaded areas indicate 
the CTEQ6.1 PDF uncertainty. The dashed curves represent slightly different param- 
eterizations for the CTEQ6.5 PDFs. The heavy quark mass corrections have a strong 
effect (larger than the PDF uncertainty for CTEQ6.1) at low zx, in a region sensitive 
to W and Z production at the LHC. 

The impact of general-mass variable flavour number schemes (GM-VFNS) lies 
mostly in the low x and Q? regions. Aside from modifications to the fits to the HERA 
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data, and the commensurate change in the fitted PDFs, there is basically no modifi- 
cation for predictions at high Q? at the LHC. 

It is also possible to use only leading-order matrix element calculations in the 
global fits which results in leading-order parton distribution functions, which have 
been made available, for example, by the CTEQ [489, 818], MSTW/MMHT (614, 
755] and NNPDF [191, 194] groups. For many hard matrix elements for processes 
used in the global analysis, there exist K factors significantly different from unity. 
Thus, one expects there to be noticeable differences between the LO and NLO parton 
distributions (and indeed this is often the case, especially at low x and high zx). 

Global analyses have traditionally used a generic form for the parameterization of 
both the quark and gluon distributions at some reference value Qo:? 


F(x, Qo) = 241 (1 — )4? P(x; Ag, A4...). (6.53) 


The reference value Qo is usually chosen in the range of 1-2 GeV. The parameter A; 
is associated with small-z Regge behaviour while Aə is associated with large-x valence 
counting rules. We expect A; to be approximately -1 for gluons and anti-quarks, and of 
the order of 1/2 for valence quarks, from the Regge arguments mentioned in Chapter 2. 
Counting rule arguments tell us that the Ag parameter should be related to 2n, — 1, 
where n, is the minimum number of spectator quarks. So, for valence quarks in a 
proton, there are two spectator quarks, and we expect A> = 3. For a gluon, there 
are three spectator quarks, and Az = 5; for anti-quarks in a proton, there are four 
spectator quarks, and thus A = 7. Such arguments are useful, for example in telling 
us that the gluon distribution should fall more rapidly with x than quark distributions, 
but it is not clear exactly at what value of Q that the arguments made above are valid. 

The first two factors, in general, are not sufficient to completely describe either 
quark or gluon distributions. The term P(x; A3,...) is a suitably chosen smooth func- 
tion, depending on one or more parameters, that adds more flexibility to the PDF 
parameterization. P(x; A3,...) is chosen so as to tend towards a constant for x ap- 
proaching either 0 or 1, so that the limiting behaviour is given by the first two terms. 

In general, both the number of free parameters and the functional form can have 
an influence on the global fit. A too-limited parameterization not only can lead to a 
worse description of the data, but also to PDFs in different kinematic regions being 
tied together not by the physics, but by the limitations of the parameterization. Note 
that the parameterization forms shown here imply that PDFs are positive-definite. As 
they are not physical objects by themselves, it is possible for them to be negative, 
especially at low Q?. Some PDF groups (such as CT) use a positive-definite form for 
the parameterization; others do not. For example, the MSTW2008 gluon distribution is 
negative for x < 0.0001, Q? = 2GeV?. Evolution quickly brings the gluon into positive 
territory. 

The CT14 fit uses 28 free parameters (many of the PDF parameters are either 
fixed at reasonable values, or are constrained by sum rules). There are a total of 8 free 


3Recently, there has been a trend towards the use of more sophisticated forms of parameterization 
in global fits, but the physics arguments listed here are still valid. For example, in the CT14 global 
fit [489], P(x) is defined by a fourth-order polynomial in yz; the polynomial is then re-expressed in 
terms of Bernstein polynomials in order to reduce correlations among the coefficients. 
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Fig. 6.5 The CT14 NNLO parton distribution functions evaluated at a 
Q? value of 2 GeV’, with a linear (left) and logarithmic (right) scale. 


parameters for the valence quarks, 5 for the gluon and 15 for the sea quarks. 

The MMHT2014 fit uses 20 free parameters, while the NNPDF fits effectively has 
259 free parameters. The NNPDF approach attempts to minimize the parameterization 
bias by exploring global fits using a large number of free parameters in a Monte Carlo 
approach. The general form for NNPDF can be written as fi(x, Qo) = ci(x) NN; (x), 
where NN;(x) is a neural network, and c;(x) is a “pre-processing function”. 

In the past, PDFs were often made available to the world in a form where the 
x and Q? dependence was parameterized. Now, almost universally, the PDFs for a 
given x and Q? range can be interpolated from a grid that is provided by the PDF 
groups, or the grid can be generated given the starting parameters for the PDFs (see 
the discussion on LHAPDF in Section 6.6). All techniques should provide an accuracy 
on the output PDF distributions on the order of a few percent or better. 

The parton distributions from the CT14 NNLO PDFs are plotted in Fig. 6.5 at a 
Q? value of 2 GeV? (near the starting point of evolution) and in Fig. 6.6 at a Q? value 
of 10000 GeV? (more typical of LHC processes). At the lower Q? value, the up quark 
and down quark distribution peak near x values of 1/3, the remnant of the spike in 
the primitive model described in Chapter 2. The charm PDF is very suppressed as 
it is produced entirely by evolution, and the Q? value is near the starting scale for 
the evolution. There is no bottom quark distribution since it is below threshold. At 
higher Q? values, the up and down quark peaks become shoulders due to the effects of 
evolution. At high Q?, the gluon distribution is dominant at x values of less than 0.1 
with the valence quark distributions dominant at higher x. One of the major influences 
of the HERA data has been to steepen the gluon distribution at low x. 

The CT14 up quark, up-bar quark, b-quark, and gluon distributions are shown as a 
function of Q? for x values of 0.001, 0.01, 0.1 and 0.3 in Figs. 6.7 and 6.8. At low x, the 
PDFs increase with Q?, while at higher x, the PDFs decrease with Q?. Both effects 
are due to DGLAP evolution, as discussed previously. An x value of approximately 
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Fig. 6.6 The CT14 NNLO parton distribution functions evaluated at a 
Q? of 10000 GeV?. 
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Fig. 6.7 The CT14 NNLO up quark, up-bar quark, c quark, and gluon 
parton distribution functions evaluated as a function of Q? at x values of 
0.001 (left) and 0.01 (right). 


0.1 is the pivot — point for gluon evolution; at this x value, the gluon distribution 
changes little as the value of Q? increases. It can also be seen that both the charm 
(and bottom) quark distributions are generated perturbatively from gluon splitting, 
and thus the distributions are zero below threshold for heavy quark pair production 
and rise rapidly thereafter with increasing Q?. It is also possible for there to be intrinsic 
charm, where charm quarks are present below threshold. However, no strong evidence 
has been observed to date for the existence of intrinsic charm. 


Fitting parton distribution functions 


421 


~ r o 1g 7 
g PDF a E 3 PDF 
= 1b =| -Up Rad E | —up 
x SETEN J x i X 7 
S ta abaeanerPeoestaket a ati J —gluon ee eee J “gluon 
E J] -upbar 10'E rerig “J --upbar 
[T “| - -charm E J --charm 
10°F = J 7 
E 3 O a N = 
102 pA r 1 1 1 ar 1 1 f 
10 10? 10° 104 i 10 10? 10° 10° 
Q?(GeV’) Q?(Gev") 
Fig. 6.8 The CT14 NNLO up quark, up-bar quark, c quark and gluon 
parton distribution functions evaluated as a function of Q? at an x value 
of 0.1 (left) and 0.3 (right). 
CT14 NNLO cT14 NNLO 
T T T T 0.50 T T 
0.04 
0.03 
A 
Y 0.02 


10° 10° 10 
Q (GeV) 


10! 10 10° ~—- 104 
Q (GeV) 


Fig. 6.9 The momentum fractions carried by the CT14 NNLO quark and 
gluon distributions, as a function of Q. The gluon distribution in the right 
figure is shown without (solid) and with (dotted) the presence of a top 
quark PDF. 


The average proton momentum carried by each parton species is shown in Fig. 6.9. 
As Q? increases, the momentum carried by up and down valence quarks decreases, 
while the momentum carried by the gluon and by sea quarks increases. For typical LHC 
hard-scattering scales, the gluon carries slightly less than 50% of the parent proton’s 
momentum. Note that a 5-flavour scheme is most commonly used, i.e. the charm 
and bottom (but not top) can appear as sea quarks once the Q value is sufficient 
to pair produce them from gluon splitting. It is also possible to allow top quarks in 
the sea in a 6-flavour scheme; even at the highest Q values, only about 1% of the 
proton’s momentum is carried by top quarks. The momentum added to the top quark 
distribution comes primarily from the gluon distribution. 

The photon is also a parton constituent of the proton, just as a quark or gluon 
is, and can be produced from QED radiation from quark lines [754]. This source of 
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photons is known as the inelastic component. There is also a (well-known) elastic com- 
ponent, which cannot be ignored, resulting from coherent electromagnetic radiation 
from the proton as a whole, leaving the proton intact [292, 587]. Both components 
contribute to photon-induced processes at the LHC. The inelastic component evolves 
with Q?, while the elastic component is relatively constant (varying with the running 
of the QED coupling). To include the photon PDF, the QCD evolution of the par- 
tons in the proton now has to be expanded to a QCD+QED evolution, where the 
QED aspect involves the electromagnetic coupling a(Q?) instead of a,(Q?) and the 
corresponding splitting function is Abelian rather than non-Abelian. At the LHC, 
especially for the 13-14 TeV running, processes involving photons in the initial state 
will become increasingly important, c.f. yy > WW, or photon-initiated production 
of WH. There is little hadronic data to directly constrain the photon PDF, but it 
has been known that less than 1% of the proton’s momentum is carried by photons. 
The first attempt (MRST2004QED) to model the inelastic component just considered 
photon emission from quark lines, using the known quark distributions, and using ei- 
ther the current quark mass (few MeV) or the constituent quark mass (few hundred 
MeV) as a cutoff [754] (see also Refs. [587, 772]). The disparity in the cutoffs leads to 
a wide range in the possible size of the photon PDF. Other attempts to determine the 
photon distribution used Drell-Yan data from the LHC [193, 194, 275](NNPDF2.3qed 
followed by NNPDF3.0qed) to fit for the photon PDF.* 

Another approach [80, 837] (CT14qed,CT14qed_inc) used data from the scattering 
process ep —> eyX measured by the ZEUS collaboration [391], leading to an upper 
constraint on the total inelastic photon PDF momentum, at a scale of 1.3 GeV, of 
approximately 0.14% (and a lower constraint of 0). This is to be compared to the total 
elastic photon PDF momentum fraction of 0.15%. The photon PDFs for the inelastic 
component only, assuming the maximum intrinsic momentum fraction of 0.14%, and 
the total photon PDF, including the elastic component as well, are shown in Fig. 6.10 
for Q=1.3 GeV (left) and Q=85 GeV (right). The dominance of the inelastic compo- 
nent at high Q can be observed. 

Recently, the photon PDF has been determined to a high precision (1-2%) using 
electron-proton scattering data, considering an equivalence between a template cross- 
section calculated either using proton structure functions, or a photon PDF [746]. 
The resulting PDF (LUXqed) is in good agreement with CT14qed_inc and is at the 
lower edge of the NNPDF2.3qed photon PDF uncertainty band at high xz, as shown 
in Fig. 6.11. 

The charm quark distribution also has a dynamic component, generated through 
gluon splitting into a cé@ pair. The photon/charm ratio increases with increasing Q 
and increasing x value. One reason for the variation in Q is that while a, decreases 
with Q, a remains approximately constant (actually, it rises slightly). At low æ, the 
photon/charm ratio is of the order of 5-10%, due to the difference in coupling constants 


4NNPDF3.0qed improves on the NNPDF2.3qed photon PDF with a correct treatment of a(asL)” 
terms in the evolution. These resulted in a large uncertainty for the photon PDF, as the errors in the 
reasonably precise high-mass Drell-Yan data are still large compared to the expected contributions 
from photon-initiated processes. 


°There may also be an intrinsic charm component at low Q, but the evidence is not convincing. 
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Fig. 6.10 The photon PDF from the inelastic component only, with a 
photon momentum fraction of 0.14%, and the total photon PDF, including 
the elastic component, at a Q value of 1.3 GeV (left) and 85 GeV (right). 
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Fig. 6.11 The ratio of various photon PDF sets to that of the LUXqed 
photon PDF set, all evaluated at a scale of 100 GeV. Note that the vertical 
axis scales are different for the sub-plots. Reprinted with permission from 
Ref. [746]. 


(as vs. œa) and the larger gluon than quark (primarily up quark) distribution at low 
x. (Remember that the photon does have a substantial elastic component at small x 
which is included in this ratio.) At high z, the dominance of the valence up quark 
over the gluon results in the photon distribution becoming larger than the charm 
distribution. 
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6.3 PDF uncertainties 


In addition to having the best estimates for the values of the PDFs in a given kine- 
matic range, it is also important to understand the allowed range of variation of the 
PDFs, i.e. their uncertainties. A prior conventional method of estimating parton dis- 
tribution uncertainties had been to compare different published parton distributions. 
Although in some cases, this procedure does provide useful information, in general this 
is unreliable since most published sets of parton distributions adopt similar assump- 
tions and the differences between the sets do not fully explore the uncertainties that 
actually exist. In addition, some PDF fits may use only a limited set of data in their 
fits, and they may result in PDFs that (a) have larger intrinsic uncertainties and (b) 
differ in their central values from PDF sets which use a more global set of input data. 
Some comparisons of PDFs and PDF predictions at the LHC will be given later in this 
chapter. 

The sum of the quark distributions (Ufyp(7, Q?) + fg/p(x, Q?)) is, in general, well 
determined over a wide range of x and Q?. As stated earlier, the quark distributions are 
predominantly determined by the DIS and DY data sets which have large statistics, and 
systematic errors in the few percent range (+3% for 1074 < x < 0.75). Thus the sum of 
the quark distributions is basically known to a similar accuracy. The individual quark 
flavours, though, may have a greater uncertainty than the sum. This can be important, 
for example, in predicting distributions that depend on specific quark flavours, like the 
W lepton asymmetry distribution and the W and Z rapidity distributions. 

The largest uncertainty of any parton distribution, however, is that on the gluon 
distribution. The gluon distribution can be determined indirectly at low x by mea- 
suring the scaling violations in the quark distributions, but a direct measurement is 
necessary at moderate to high x. About 40-50% of the momentum of the proton is 
carried by gluons, and most of that momentum is at relatively small x (16% of the 
momentum of the proton, for example, is carried by gluons in the x range from 0.01 
to 0.1.) The best direct information on the gluon distribution at moderate to high x 
comes from jet production at the TEVATRON and the LHC, although new processes 
(photon production, top pair production, etc.) will increasingly contribute. 

There has been a great deal of activity on the subject of PDF uncertainties. Two 
techniques in particular, the Lagrange Multiplier and Hessian techniques, have been 
used by CTEQ/CT, MSTW/MMHT, and HERAPDF to estimate PDF uncertain- 
ties [413, 753, 817, 819, 868]. The Lagrange Multiplier technique is useful for probing 
the PDF uncertainty of a given process, such as the Higgs boson cross-section, while 
the Hessian technique provides a more general framework for estimating the PDF un- 
certainty for any cross-section. In addition, the Hessian technique results in tools more 
accessible to the general user, such as error PDFs. Both techniques are described in the 
following. The Monte Carlo technique used by the NNPDF group for PDF uncertainty 
estimation will be described later in this section. 


6.3.1 The Hessian method 


The Hessian method for the determination of a central PDF and/or determination 
of PDF uncertainties involves minimizing a suitable log-likelihood function. The x? 
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Fig. 6.12 A comparison of the unshifted and shifted HERA1 combined 
neutral current data (left) and the comparison of the NLO CT10 predic- 
tions to the shifted data (right). Reprinted with permission from Ref. [713]. 


function may contain the full set of correlated errors, or only a partial set. The corre- 
lated systematic errors may be accounted for using a covariance matrix, or as a shift 
to the data, adopting a x? penalty proportional to the size of the shift divided by the 
systematic error. The two methods should be equivalent. Below we discuss the shift 
method. 

In the following description, CT10 NLO PDFs [713] are used (similar considerations 
apply at NNLO), along with the HERA Run I combined (H1+ZEUS) neutral current 
(etp) cross sections [77], to discuss the use of the Hessian formalism. A comparison of 
the HERA data and the NLO predictions using the CT10 PDFs is shown in Fig. 6.12. On 
the left, the data are presented in unshifted form, and on the right, optimal systematic 
error shifts have been applied in the manner detailed below. There is good agreement 
between the combined HERA Run I data and the CT10 NLO predictions with a global 
x? of about 680 for the 579 data points, which is typical for the global fit PDFs. 

The HERA Run I combined data have N, = 114 independent sources of experimen- 
tal systematic uncertainty, with parameters A, that should obey a standard Gaussian 
or normal distribution. The contribution of the HERA dataset to the x? can be written 


as 
N 


Ny 2 Ny 
P((a}, = 04 Q Ee Shasta) +55, 654) 


k=1 “k 


where N is the total number of points and T;,(a) is the theory value for the kth data 
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Fig. 6.13 Distribution of systematic parameters Aq of the combined 
HERA Run I data set [871] in the CT10 best fit (CT10.00). Reprinted 
with permission from Ref. [713]. 


point, dependent on the PDF parameters a. Furthermore, sẹ is the total uncorre- 
lated error on the measurement Dk that is obtained by summing the statistical and 
uncorrelated systematic errors on D, in quadrature, 


Sk = 2 stat + Si. uncorr sys (San) 


The y? function is minimized with respect to the size of the systematic error shifts Àa 
using the algebraic procedure described. It is also possible to add a penalty term to 
the x? function that prevents relatively unconstrained PDF parameters from reaching 
values that might lead to unphysical predictions in regions where experimental data 
are sparse (e.g. very low 2). 

As expected, a better agreement of data with theory is observed when the sys- 
tematic error shifts are allowed. It is important to check that the systematic error 
parameters Àa(a) contribute to the x? an amount on the order of the total number 
of systematic errors (114) and (b) that the sizes of the parameters follow a Gaus- 
sian distribution. For the case of CT10 and the HERA Run I data, the systematic 
error contribution to the total x? is 65, or somewhat better than expected, and their 
distribution is approximately Gaussian-distributed, as shown in Fig. 6.13. 

All systematic errors are not equally important, though, and it is also crucial to 
verify that no ’major’ systematic error needs to be shifted by several sigma. Given the 
precision of the HERA Run I data, the size of the systematic error shifts required are 
relatively small. This need not be the case, for example, for the case of inclusive jet 
production, either at the TEVATRON or the LHC. 

The Hessian method results in the production of a central (best fit) PDF, and 
a set of error PDFs. In this method, a large matrix (26 x 26 for CT10 and 28 x 28 
for CT14), with dimension equal to the number of free parameters in the fit, has to 
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Fig. 6.14 A schematic representation of the transformation from the PDF 
parameter basis to the orthonormal eigenvector basis. Reprinted with per- 
mission from Ref. [818]. 


be diagonalized.® The result is 26 (28) orthonormal eigenvector directions for CT10 
(CT14) which provide the basis for the determination of the PDF error for any cross- 
section. 

This process is shown schematically in Fig. 6.14. The eigenvectors are now admix- 
tures of the PDF parameters left free in the global fit. There is a broad range for the 
eigenvalues, over a factor of one million. The eigenvalues are distributed roughly lin- 
early as log e;, where €; is the eigenvalue for the i-th direction. The larger eigenvalues 
correspond to directions which are well-determined; for example, eigenvectors 1 and 
2 are sensitive primarily to the valence quark distributions at moderate x, a region 
where they are well-constrained. The theoretical uncertainty on the determination of 
the W mass at the TEVATRON depends primarily on these 2 eigenvector directions, 
as W production at the TEVATRON proceeds primarily through collisions of valence 
quarks. The most significant eigenvector directions for determination of the W mass 
at the LHC correspond to larger eigenvector numbers, which are primarily determined 
by sea quark distributions. In most cases, the eigenvector can not be directly tied to 
the behaviour of a particular PDF in a specific kinematic region. There are exceptions, 
such as eigenvector 15 in the CTEQ6.1 fit, discussed in the following. 

In the past, one of the most controversial aspect of PDF uncertainties has been 
the determination of the Ax? excursion from the central fit that is representative 
of a reasonable error. Nominally, a Ay? = T?(tolerance) would correspond to a 
1 — 0(68%CL) error. PDF fits performed with a limited number of experiments may 
be able to maintain that criterion. For example, HERAPDF uses a x? excursion of 1 
for a lo error.’ For general global fits, such as from CT and MMHT, however, a x? 


6 As more data is included, more PDF parameters in the global fit can be set free, resulting in a 
larger number of eigenvectors. 

“But the total error also includes other sources of uncertainty, for example from possible parame- 
terization bias. 
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excursion of 1 (for a lo error) is too low of a value in a global PDF fit. These global fits 
use data sets arising from a number of different processes and different experiments; 
there is a non-negligible tension between some of the different data sets. In addition, 
the finite number of PDF parameters used in the parameterizations (parameterization 
bias) also leads to the need for a larger tolerance. Thus, a larger variation in Ax? is 
required for a 68% CL. For example, CT10 uses a tolerance T=10 for a 90% CL error, 
corresponding to T=6.1 for a 68% CL error,® while MSTW uses a dynamical tolerance 
(varying from 1 to 6.5) for each eigenvector. 

The uncertainties for all predictions should be linearly dependent on the tolerance 
parameter used; thus, it should be reasonable to scale the uncertainty for an observable 
from the 90% CL limit provided by the CT error PDFs to a one-sigma error by dividing 
by a factor of 1.645. Such a scaling will be a better approximation for observables 
more dependent on the lower number eigenvectors, where the x? function is closer to 
a quadratic form. 

Even though, the data sets and definitions of tolerance are different among the 
different PDF groups, we will see later in this chapter that the PDF uncertainties at 
the LHC are fairly similar. Note that relying on the errors determined from a single PDF 
group may be an underestimate of the true PDF uncertainty, as the central results 
among the PDF groups can in some cases differ by an amount similar to this one-sigma 
error. (See the discussion later in this chapter regarding benchmarking comparisons of 
predictions and uncertainties for the LHC.) 

Each error PDF results from an excursion along the “+” and “—” directions for each 
eigenvector. Consider a variable X; its value using the central PDF for an error set (say 
CT14) is given by Xo. X t is the value of that variable using the PDF corresponding 
to the “+” direction for eigenvector i and X; the value for the variable using the 
PDF corresponding to the “—” direction. The excursions are symmetric for the larger 
eigenvalues, but may be asymmetric for the more poorly determined directions. In 
order to calculate the PDF error for an observable, a Master Equation should be 
used: 


N 
AX faa = \ X [mazr(X} — Xo, X7 — Xo,0)]? 
w=1 
N 
AX 7a = \ S [maz(Xo — X}, Xo — X;,0)]?. (6.56) 
t=1 


AX* adds in quadrature the PDF error contributions that lead to an increase in 
the observable X and AX- the PDF error contributions that lead to a decrease. The 
addition in quadrature is justified by the eigenvectors forming an orthonormal basis. 
The sum is over all N eigenvector directions, or 20 in the case of CTEQ6.1 and 26 
(28) in the case of CT10 (CT14). Ordinarily, Xt — Xo will be positive and X7 — Xo 


8A penalty is applied in the CT10 approach, if a disproportionate fraction of the increase in x? 
along a particular eigenvector direction is concentrated in one, or a few, experiments; this results in 
the tolerance being effectively smaller for those directions. 
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will be negative (or vice versa), and thus it is trivial as to which term is to be included 
in each quadratic sum. For the higher number eigenvectors, however, the “+” and 
“_” contributions may be in the same direction (see for example eigenvector 17 in 
Fig. 6.15). In this case, only the most positive term will be included in the calculation 
of AX* and the most negative in the calculation of AX. Thus, there may be less than 
N terms for either the “+” or “—” directions. There are other versions of the Master 
Equation in current use but the version listed above is the “official” recommendation 
of the authors. 

There are two things that can happen when new PDFs (eigenvector directions) are 
added: a new direction in parameter space can be opened to which some cross-sections 
will be sensitive to (an example of this is eigenvector 15 in the CTEQ6.1 error PDF 
set, which is sensitive to the high x gluon behaviour and thus influences the high 
pr jet cross-section at the TEVATRON and LHC). This particular eigenvector direction 
happens to be dominated by a parameter which affects mostly the large x behaviour 
of the gluon distribution. 

In this case, a smaller parameter space is an underestimate of the true PDF error 
since it did not sample a direction important for some physics. In the second case, 
adding new eigenvectors does not appreciably open new parameter space and the new 
parameters should not contribute much PDF error to most physics processes (although 
the error may be redistributed somewhat among the new and old eigenvectors). 

In Fig. 6.15, the PDF errors are shown in the “+” and “—” directions for the 
20 CTEQ eigenvector directions for predictions for inclusive jet production at the 
TEVATRON from the CTEQ6.1 PDFs. The excursions are symmetric for the first 10 
eigenvectors but can be asymmetric for the last 10, as they correspond to less well- 
determined directions. 

Either Xo and X; can be calculated separately in a matrix element /Monte Carlo 
program (requiring the program to be run 2N +1 times) or Xo can be calculated with 
the program and at the same time the ratio of the PDF luminosities (the product of 
the two PDFs at the x values used in the generation of the event) for eigenvector i 
+) to that of the central fit can be calculated and stored. This results in an effective 
sample with 2N +1 weights, but identical kinematics, requiring a substantially reduced 
amount of time to generate. PDF re-weighting will be discussed later in this chapter. 

As an example of PDF uncertainties using the Hessian method, the CT10 and 
MSTW 2008 NLO uncertainties for the up quark and gluon distributions are shown in 
Figs. 6.16 and 6.17. While the CT10 and MSTW2008 PDF distributions and uncer- 
tainties are reasonably close to each other, some differences are evident, especially at 
low and high z. 

After the initial diagonalization of the Hessian matrix, it is also possible to diago- 
nalize any one chosen function of the fitting parameters while maintaining a diagonal 
form for the x? function [816]. This additional function could be a particular cross- 
section, say for Higgs boson production through gg fusion. It may be that such an 
observable may be dominated by a few eigenvector directions, something which will 
be illuminated by the additional diagonalization. It is also possible to determine the 
particular direction in eigenvector space that has the greatest sensitivity to a partic- 
ular observable, i.e. the steepest gradient. This will become important when looking 


—— 
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Fig. 6.15 The PDF errors for the CDF inclusive jet cross-section in Run I 

for the 20 different eigenvector directions contained in the CTEQ6.1 PDF 
error set. The vertical axes show the fractional deviation from the central 
prediction and the horizontal axes the jet transverse momentum in GeV. 
Reprinted with permission from Ref. [867]. 


for PDF correlations among cross-sections, and in the discussion of meta-PDFs. 


6.3.2 The Lagrange Multiplier method 


Another technique for determining the uncertainty on a physical observable is the 
Lagrange Multiplier (LM) method [868]. The LM method can be considered as an 
extension of the x? minimization procedure that relates the uncertainty of the physical 
observable (which depends on the PDFs) to the variation of the x? function used in 
the global fitting. A Lagrange Multiplier variable is introduced and the function 
WO, a) = Xetobai(@) +AX(a) is minimized for different values of the parameter À. Here, 
a refers to the original set of (d) parameters determined in the PDF fit, and X (a) is the 
physical observable dependent upon those PDF parameters. The Lagrange Multiplier 
method provides optimal PDFs tailored for a specific study. A representation of the 
LM method is shown in Fig. 6.18 where a hypothetical mapping of the set of d PDF 
parameters corresponding to various values of the LM parameter À are mapped onto 
the global x? plotted as a function of the possible values of the physical observable X. 


An example Lagrange Multiplier analysis for the production of a Higgs boson 
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Fig. 6.16 A comparison of the CT10 and MSTW2008 up quark PDF 
uncertainty bands at Q? = 10* GeV?. The NNPDF2.3 central up PDF is 
also shown for comparison. 


through gg fusion at 8 and 13 TeV using the CT14 PDFs is shown in Fig. 6.19. The 
parabolic curve has been determined using the Lagrange Multiplier method, while the 
points indicate the results of the Hessian analysis for 90% CL (Ax? = 100). (The 
dashed curve shows the results of applying the Tier-2 penalty (a penalty that prevents 
the agreement with any particular experiment from degrading too greatly) when the 
x? of any experiment starts seriously degrading. 

Table 6.1 is reproduced below from Ref. [489], showing both the PDF and the 
PDF-+a,(mz) uncertainties determined from the Hessian and the Lagrange Multiplier 
methods for Higgs boson production through gg fusion at NNLO. The results indicate 
both the agreement between the Hessian and Lagrange Multiplier techniques and the 
efficacy of scaling the 90%CL Hessian uncertainty by a factor of 1.645 to get the 68% 
CL uncertainty. 


6.3.3 The NNPDF approach 


For predictions using NNPDF PDFs, a Monte Carlo sample of PDFs is given, such 
that the expectation value of any observable F'[q] depending on the PDFs is calculated 
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Fig. 6.17 A comparison of the CT10 and MSTW2008 gluon PDF uncer- 
tainty bands at Q? = 10* GeV’. The NNPDF2.3 central gluon PDF is also 


shown for comparison. 


Table 6.1 Uncertainties of c#(gg —> H) at NNLO computed by the LM method and by 
the Hessian method, with the Tier-2 penalty included. The 68% C.L. errors are given as 
percentage of the central value, and the PDF-only uncertainties are for a, = 0.118. 


gg — H (pb), PDF unc., ag = 0.118 8 TeV 13 TeV 
68% C.L. (Hessian) 18.7 + 2.1% — 2.3% | 42.7 + 2.0% — 2.4% 
68% C.L. (LM) 42.3% — 2.3% +2.4% — 2.5% 
gg + H (pb), PDF+a, 8 TeV 13 TeV 
68% C.L. (Hessian) 18.7 + 2.9% —3.0% | 42.7+3.0% — 3.2% 
68.0% CL (LM) +3.0% — 2.9% $3.2% — 3.1% 


over an ensemble of PDF replicas using the formula 


(Fl{a}]) = 


Nrep 


yo Fie), 


k= 


1 
Nrep 


pan 


(6.57) 
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Ay? 


PDF uncertainties 433 


2-dim (i,j) rendering a: X: physics 
of d-dim PDF á variable 
parameter space 


Lx : 


contours of X global 


MC sampling - 


xX LM method @¢ 


Xg-AX Xo AptAX 


Fig. 6.18 On the left is an illustration of how the Lagrange Multiplier 
Method provides sample points along a curve Lx in the multi-dimensional 
parameter space, where X is the observable of interest. On the right is 
an illustration of how these sample points are mapped onto a global x? 
distribution, plotted as a function of the value of the cross-section of the 
observable x. Reprinted with permission from Ref. [868]. 
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Fig. 6.19 A calculation of the x° distribution vs the NNLO Higgs bo- 
son cross-section in the CT14 global analysis at 8 TeV (left) and 13 TeV 
(right). The Lagrange Multiplier (curve) and Hessian approaches (dots) 
are compared. Reprinted with permission from Ref. [490]. 


where Nrep is the number of replicas of PDFs in the Monte Carlo ensemble. The 
uncertainty for any observable is calculated as the standard deviation of the sample. 


1/2 
or = (NOEs (FILA) - FH) 
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1/2 
Nrep / 


D (Fite - FD) (6.58) 


Nrep bat 


This equation provides the 1-sigma error on any observable; one advantage of the 
Monte Carlo approach is that any confidence-level can be calculated by removing the 
appropriate upper and lower PDF outliers. The NNPDF collaboration provides sets of 
Nrep=100 and 1000 replicas. For most applications, the smaller replica set is sufficient. 
A central set corresponding to the average of the replicas 


===> Yi. (6.59) 


is also provided[190-192, 194]. 


6.3.4 Meta-PDFs 


It is possible to re-fit and to re-parameterize, in a common functional form, the error 
PDFs from a number of PDF fitting groups. The result is an ensemble of PDF's that 
encompasses the uncertainty of all of the PDF error sets included. The ensemble can 
also be expanded to cover the combined PDF+a, uncertainties, instead of just the 
PDF uncertainties alone. The ensemble can be transformed into a Hessian basis, and 
then only a limited number of the (important) eigenvectors can be retained, leading to 
a smaller ensemble that provides PDFs corresponding to both the central behaviour 
of the group of PDFs (for example, CT14, MMHT14, and NNPDF3.0) and to the full 
uncertainty range. Such PDFs are known as meta-PDFs [552] and they can make it 
easier to calculate PDF(+a;) uncertainties for any observable at the LHC. In addition, 
by using the technique of data set diagonalization, the number of error PDFs needed 
to describe the PDF + as uncertainties for all Higgs production processes for all LHC 
energies can be reduced to 8. 


6.3.5 PDF uncertainties and evolution 


Evolution is the great equalizer. Parton uncertainties tend to decrease as the factoriza- 
tion scale increases. This can be seen for example for the case of the gluon distribution 
in Figs. 6.20, 6.21 6.22 and 6.23 where the gluon uncertainties for the CT10 and 
MSTW2008 NLO PDFs are shown for Q? values of 2, 10, 100, and 10000 GeV”. Aside 
from the high x region, a significant decrease in the uncertainty is observed. 


6.3.6 PDF uncertainties and Sudakov form factors 


As discussed in the above section, it is often useful to use the error PDF sets with par- 
ton shower Monte Carlos. The caveat still remains that a true test of the acceptances 
would use a NLO MC. Similar to their use with matrix element calculations, events 
can be generated once using the central PDF and the PDF weights stored for the error 
PDFs. These PDF weights then can be used to construct the PDF uncertainty for any 
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Fig. 6.20 A comparison of the uncertainties of the CT10 and MSTW2008 
NLO gluon distributions, along with the central PDF of NNPDF2.3, for a 
Q? value of 2 GeV?. 


observable. One additional complication with respect to their use in matrix element 
programmes is that the parton distributions are used to construct the initial state par- 
ton showers through the backward evolution process. The space-like evolution of the 
initial state partons is guided by the ratio of parton distribution functions at different 
x and Q? values. Thus the Sudakov form factors in parton shower Monte Carlos will be 
constructed using only the central PDF and not with any of the individual error PDFs 
and this may lead to some errors for the calculation of the PDF uncertainties of some 
observables. However, it was demonstrated in Ref. [578] that the PDF uncertainty for 
Sudakov form factors in the kinematic region relevant for the LHC is minimal, and the 
weighting technique can be used just as well with parton shower Monte Carlos as with 
matrix element programmes. 


6.3.7 Choice of as(mz) and related uncertainties 


Global PDF fits are sensitive to the value of the strong coupling constant a;, explicitly 
through the QCD cross-sections used in the fits, and implicitly through the scaling 
violations observed in DIS. In fact, a global fit can be used to determine the value 
of as(mz), albeit less accurately than provided by the world average. Historically, 
some PDF groups have used the world average value of ag(mz) [239, 240, 791] as 
a fixed constant in the global fits, while other groups have allowed a,(mz) to be a 
free parameter in the fit. It is also possible to explore the effects of the variation of 
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Fig. 6.21 A comparison of the uncertainties of the CT10 and MSTW2008 
NLO gluon distributions, along with the central PDF of NNPDF2.3, for a 
Q? value of 10 GeV?. 


as(mz) by producing PDFs at different fixed as(mz) values. There is now a consensus 
to use as(mz) = 0.118 as a central value (basically an approximation/truncation of 
the current world value) in global PDF fits,at both NLO and NNLO, and to publish 
alternative fits with as(mz) values in intervals of +0.001 around that central value. 
It is expected that the LO value of as(mz) is considerably larger than the NLO value 
(0.130 compared to 0.118 for the CTEQ/CT PDFs, for example). 

There is a correlation/anti-correlation between the value of as(mz) used in the 
global PDF fit and the gluon distribution; whether there is a correlation or anti- 
correlation depends on the gluon x range being considered. At low x (less than 0.1), 
a decrease in the value of as(mz) results in an increase in the gluon distribution and 
vice versa, i.e. there is an anti-correlation. The net impact is to reduce the sensitivity 
of cross-sections that depend on both the value of as(mz) and the gluon distribution 
in this x range to variations in the value of a,(mz). The sensitivity becomes smaller 
as the x value approaches 0.1. In the x range from 0.1 to 0.8, there is a correlation 
between the value of as(mz) and the gluon distribution, with the correlation becoming 
larger as the x value increases. 

The diagonalization technique can also be used with respect to the value of as(mz); 
in fact, it can be shown, using this technique, that, within the quadratic approximation, 
the uncertainty in a;(mz) is uncorrelated with the PDF uncertainty [714]. Thus the 
combined PDF+as can be calculated by computing the 1 — o PDF uncertainty with 
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Fig. 6.22 A comparison of the uncertainties of the CT10 and MSTW2008 
NLO gluon distributions, along with the central PDF of NNPDF2.3, for a 
Q? value of 100 GeV?. 


Qs(mz) fixed at its central value, and adding in quadrature the 1 — ø uncertainty in 
as(mz) (and of course, this can also be done for any other desired confidence level). 


6.3.8 PDF correlations 


The uncertainty analysis may be extended to define a correlation between the uncer- 
tainties of two variables, say X(@) and Y(@). As for the case of PDFs, the physical 
concept of PDF correlations can be determined both from PDF determinations based 
on the Hessian approach and on the Monte Carlo approach. 


6.3.8.1 PDF correlations in the Hessian approach 


Consider the projection of the tolerance hypersphere onto a circle of radius 1 in the 
plane of the gradients VX and VY in the parton parameter space (776, 817]. The 
circle maps onto an ellipse in the XY plane. This “tolerance ellipse” is described by 
Lissajous-style parametric equations, 


X = Xo + AX cos9, (6.60) 
Y = Yo + AY cos(0 + vy), (6.61) 
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Fig. 6.23 A comparison of the uncertainties of the CT10 and MSTW2008 
NLO gluon distributions, along with the central PDF of NNPDF2.3, for a 
Q? value of 10000 GeV?. 


where the parameter 0 varies between 0 and 27, Xo = X (Go), and Yo = Y (ão). AX and 
AY are the maximal variations 0X = X — Xo and ôY = Y — Yo evaluated according 
to the Master Equation, and ¢ is the angle between VX and VY in the {a;} space, 
with 


N 


_VX-VY¥_ 1 GO. ge NO eS 
Sones SRY, axa l (X =a Ie Sk ). (6.62) 


i= 


The quantity cos y characterizes whether the PDF degrees of freedom of X and Y 
are correlated (cosy œ% 1), anti-correlated (cosy ~ —1), or uncorrelated (cosy ~ 0). 
If units for X and Y are rescaled so that AX = AY (e.g, AX = AY = 1), the 
semimajor axis of the tolerance ellipse is directed at an angle 7/4 (or 37/4) with 
respect to the AX axis for cosy > 0 (or cosy < 0). In these units, the ellipse reduces 
to a line for cosy = +1 and becomes a circle for cos y = 0, as illustrated by Fig. 6.24. 
These properties can be found by diagonalizing the equation for the correlation ellipse. 
Its semi-minor and semi-major axes (normalized to AX = AY) are 
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Fig. 6.24 Correlations ellipses for a strong correlation (left), no correla- 
tion (centre) and a strong anti-correlation(right). Reprinted with permis- 
sion from Ref. [774]. 


sin yp 
minor, 4major sf = : 6.63 
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The eccentricity € = J 1 — (Gminor/Qmajor)? is therefore approximately equal to \/|cos ¢| 
as |cosy| > 1. 
The ellipse itself is described by 


XN? (YN? 5X \ / 6Y 
(=) (=) 2 (=) (=) cosy = sin? y. (6.64) 


A magnitude of | cosy] close to unity suggests that a precise measurement of X 
(constraining dX to be along the dashed line in Fig. 6.24) is likely to constrain tangibly 
the uncertainty Y in Y, as the value of Y shall lie within the needle-shaped error 
ellipse. Conversely, cosy ~ 0 implies that the measurement of X is not likely to 
constrain 6Y strongly.’ 

The values of AX, AY, and cos y are also sufficient to estimate the PDF uncertainty 
of any function f(X,Y) of X and Y by relating the gradient of f(X,Y) to Oxf = 
Of /OX and dy f = Of /OY via the chain rule: 


Af = [7z = (AX dxf )?+2AX AY cosy Oxf ðyf+(AY Of). (6.65) 


Of particular interest is the case of a rational function f(X,Y) = X™/Y™, pertinent to 
computations of various cross-section ratios, cross-section asymmetries, and statistical 
significance for finding signal events over background processes [776]. For rational 
functions Eq. (6.65) takes the form 


A 
< 


The allowed range of 6Y/AY for a given ô = 6X/AX is r < ôY/AY < ro, where rG 
cosy + v1 -— ô? sin ọ. 
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Fig. 6.25 Contour plots of the correlation cosine between two PDFs, for 
the up quark (left) and the gluon (right). 


Af ( AX ) AX AY ( AY i 
a m 2mn cos Hin : 6.66 
fo y Xo y Yo (6-66) 


For example, consider a simple ratio, f = X/Y. Then Af/fo is suppressed (A f/ fo 
|[AX/Xo — AY/Yo|) if X and Y are strongly correlated, and it is enhanced (Af /fo 
AX/Xo + AY/Yo) if X and Y are strongly anticorrelated. 

As would be true for any estimate provided by the Hessian method, the correlation 
angle is inherently approximate. Eq. (6.62) is derived under a number of simplifying 
assumptions, notably in the quadratic approximation for the x? function within the 
tolerance hypersphere, and by using a symmetric finite-difference formula for {0;X} 
that may fail if X is not monotonic. Even with these limitations in mind, the correlation 
angle is a convenient measure of the interdependence between quantities of diverse 
nature, such as physical cross-sections and parton distributions themselves. 

Correlations can be calculated between two PDFs, fai(a1, 1) and fa2(X2, u2) at 
a scale “Wy = u2 =85 GeV. In the figure below,the self-correlations for the up quark 
(left) and the gluon (right) are shown. Light (dark) shades of grey correspond to coso 
close to 1 (-1). Each self-correlation includes a trivial correlation (cos = 1) when zı 
and x2 are approximately the same (along the x; = x2 diagonals). For the up quark, 
this trivial correlation is the only pattern present. The gluon distribution, however, 
also shows a strong anti-correlation when one of the x values is large and the other 
small. This arises as a consequence of the momentum sum rule. 

PDF correlations for physics processes at the LHC will be discussed later in this 
chapter. 


LR 2 
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6.3.8.2 Correlations within the Monte Carlo approach 


Correlations can also be calculated using the Monte Carlo approach, as practiced for 
example by NNPDF [135]. The correlation cosine between two observables A and B 
can be calculated in this approach as 


Nrep _ (AB)rep — (A) rep(B) rep 
(Nrep ~ 1) CAOB 


cosġ|A, B] = (6.67) 

where the averages are taken over the ensemble of the n;ep values of the observables 
computed with the different replicas of the NNPDF set, and o,4,p are the standard 
deviations of the ensembles. 


6.4 Resulting parton distribution functions 
6.4.1 LO, NLO and NNLO PDFs 


Global PDF fitting groups have also traditionally produced sets of PDFs in which 
leading order rather than next-to-leading order (or next-to-next-to-leading order) ma- 
trix elements, along with the 1-loop ag rather than the 2-loop ag, have been used to 
fit the input datasets. The resultant leading order PDFs have most often been used in 
conjunction with leading order matrix element programmes or parton shower Monte 
Carlos. However, the leading order PDFs of a given set will tend to differ from the 
central PDFs in the NLO fit, and in fact will most often lie outside the PDF error 
band. Such is the case for the up quark distribution and the gluon distribution from 
the CTEQ6.1 set of PDFs shown in Fig. 6.26 where the LO PDFs are plotted along 
with the NLO PDF error bands.!° The LO up quark distribution is considerably larger 
than its NLO counterpart at both small x and large x. This is due to (1) the larger 
gluon distribution at small x for the LO PDF and (2) the influence of missing log(1— x) 
terms in the LO DIS matrix element. The gluon distribution is outside of the NLO 
error band basically for all x. It is higher than the NLO gluon distribution at small x 
due to missing log(1/x) terms in the LO DIS matrix element. It is smaller than the 
NLO gluon distribution at large x basically due to the momentum sum rule and the 
lack of constraints at high zx. 

The global PDF fits are dominated by the high statistics, low systematic error 
deep inelastic scattering data, and the differences between the LO and NLO PDFs are 
determined most often by the differences between the LO and NLO matrix elements 
for deep inelastic scattering. This is especially true at low x and at high x, due to 
missing terms that first arise in the hard matrix elements for DIS at NLO. As the 
NLO corrections for most processes of interest at the LHC are reasonably small, the 
use of NLO PDFs in conjunction with LO matrix elements will most often give a closer 
approximation of the full NLO result (although the result remains formally LO). In 
many cases in which a relatively large K-factor results from a calculation of collider 
processes, the primary cause is the difference between LO and NLO PDFs, rather than 
the differences between LO and NLO matrix elements. 


10These observations are true in general for comparison of any sets of LO and NLO PDFs. 
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Fig. 6.26 The CTEQ6L1 up quark and gluon PDFs, evaluated at 
Q? = 10* GeV? compared to the CT10 NLO PDF error bands for the 


same. 


In most cases, LO PDFs will be used not in fixed order calculations, but in pro- 
grammes where the LO matrix elements have been embedded in a parton shower 
framework. In the initial state radiation algorithms in these frameworks, shower pa- 
trons are emitted at non-zero angles with finite transverse momentum, and not with a 
zero kr implicit in the collinear approximation. It might be argued that the resulting 
kinematic suppression due to parton showering should be taken into account when 
deriving PDFs for explicit use in Monte Carlo programmes. Indeed, there is substan- 
tial kinematic suppression for production of a low-mass (10 GeV) object at forward 
rapidities due to this effect, but the suppression becomes minimal once the mass rises 
to the order of 100 GeV [715]. 


6.4.2 Modified LO PDFs 


Due to the inherent differences between LO and NLO PDFs, and the relatively small 
differences between LO and NLO matrix elements for processes of interest at the LHC, 
LO calculations at the LHC using LO PDFs often lead to erroneous predictions. This 
is true not only of the normalization of the cross-sections, but also for the kinematic 
shapes. This can be seen for example in the predictions for the W+/ —/Z and Higgs 
rapidity distributions seen in Fig. 6.27, where the wrong shapes for the vector boson 
rapidity distributions result from the deficiencies of the LO DIS matrix elements used 
in the fit. This can have an impact, for example, if the LO predictions are used to 
calculate final-state acceptances. 

In an attempt to reduce the size of the errors obtained using LO PDFs with LO 
predictions, modified LO PDFs have been produced. The techniques used to produce 
these modified PDFs include (1) relaxing the momentum sum rule in the global fit 
and (2) using NLO pseudo-data in order to try to steer the fit towards the desired 
NLO behaviour. Both the CTEQ [715] and MRST [846] modified LO PDFs use the 
first technique, while the CTEQ PDFs use the second technique as well. 

Of course, the desired behaviour can also be obtained (in most cases) by the use of 
NLO PDFs in the LO calculation. Here, care must be taken that only positive-definite 
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Fig. 6.27 A comparison of NLO predictions for SM boson rapidity dis- 
tributions to LO predictions for the same, using CTEQ6.6 and CTEQ6L1 
PDFs respectively. Reprinted with permission from Ref. [715]. 


NLO PDFs be used. Increasingly, most processes of interest have been included in 
NLO parton shower Monte Carlos. Here, the issue of LO PDFs becomes moot, as 
NLO PDFs must be used in such programs for consistency with the matrix elements. 
As a result, the use of modified LO PDFs has been decreasing. 


6.4.3 NNLO PDFs, and beyond 


All of the PDF groups now have PDFs determined at NNLO as well as at NLO. The 
transition from NLO to NNLO results in much smaller changes to the PDFs than 
for the transition from LO to NLO. Although the changes from NLO to NNLO are 
much smaller than those from LO to NLO, they can still be observed, especially at 
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Fig. 6.28 CT14 NNLO PDFs as a function of x for Q = 2 GeV (left) and 
Q =100 GeV (right). Reprinted with permission from Ref. [489]. 


low Q values.!! At higher Q values, though, the differences are reduced. As the PDF 
uncertainties are dominated by the experimental errors of the data included in the 
PDF fits, the uncertainties at NNLO will be similar to those determined at NLO. 

As mentioned previously, the most recent PDF set from the CTEQ-TEA group is 
CT14 [489]. The NNLO PDFs from CT14 are shown in Fig. 6.28 for a Q values of 2 
GeV and 100 GeV. 

Differences between the CT14 and the CT10 PDFs for the up quark and the gluon 
distributions are shown in Fig. 6.29. The differences are relatively small, within the 
error bands of either PDF set, and tend to be the most significant at low x and high 
x where the PDFs are the most unconstrained. One of the most important changes is 
not easily visible in these plots; that is of the gluon distribution in the x region around 
0.01. The changes from CT10 to CT14 are small, but have an impact on the PDF 
uncertainty for Higgs boson production at the LHC, as discussed in Section 6.5.1. 

As described in Chapter 2, the gg fusion cross-section for Higgs production to 
NNNLO has been completed. Since it will be quite some time before any PDFs are 
produced at this order, there arises the question as to the level of error produced when 
NNLO PDFs are used with such NNNLO calculations. It has been shown [531] that 
the error should be much smaller than the level of the difference between the NLO 
and the NNLO matrix elements. 


6.5 CT14 and parton luminosities 
It is useful to introduce the idea of differential parton-parton luminosities. Such lu- 


minosities, when multiplied by the dimensionless cross-section ŝô for a given process, 


11For example, at low Q, the order a2 evolution in the NNLO PDF suppresses g(x, Q) and increases 
q(x, Q) relative to NLO. 
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Fig. 6.29 A comparison of the CT10 and CT14 up quark (left) and gluon 
(right) distributions. Reprinted with permission from Ref. [489]. 


provide a useful estimate of the size of an event cross-section at the LHC. Below we 
define the differential parton-parton luminosity dL;;/d8 dy and its integral dL;,;/dé: 


dli; 1 1 
di dy eae [fi(r1, u) fj (v2, 4) + (1 © 2)]. (6.68) 


The prefactor with the Kronecker delta avoids double-counting in case the partons are 
identical. The generic parton-model formula 


1 
c= >/ dz dza fi(t1, p) fj (G2, U) Fig (6.69) 
ij 


can then be written as!” 


=<) ($ av) (=) (3 ôij) . (6.70) 


Fig. 6.30 shows a plot of the luminosity function integrated over rapidity, dL;j/d8 = 
J (dL,;/d8 dy) dy, at the LHC ys = 13 TeV for various parton flavour combinations, 
for the CT14 PDFs. The gluon-gluon PDF luminosity dominates at low mass, the 
gluon-quark PDF luminosity for masses from 300 GeV to approximately 2 TeV, with 
the quark-quark luminosity being largest for all masses above 2 TeV. 


6.5.1 Comparison of PDFs at the LHC 


As mentioned earlier in this chapter, the three major PDF fitting groups (with the 
latest PDF sets being CT14, MMHT2014, and NNPDF3.0) fit to basically the same 
data sets (albeit with different kinematic cuts in some cases). However, there can 
still be differences in the resultant PDF fits and uncertainties due to differences in 
the fitting procedures and to details such as the parameterizations and heavy quark 


12Note that this result is easily derived by defining T = 21 22 = 8/s and observing that the Jacobian 
a(r, y)/O(a1, £2) = 1. 
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Fig. 6.30 The parton-parton luminosities for CT14 for pp collisions at 13 
TeV plotted as a function of mass (Mx = V3). 


flavour schemes used, etc. However, as time has progressed, the tendency has been for 
the three groups to have results that are in good agreement with each other. 
Consider for example the quark-antiquark and gluon-gluon PDF luminosity uncer- 
tainties as a function of mass, at NNLO, for CT14, MMHT2014, and NNPDF3.0 (and 
the prior, to be defined in the following) for an LHC centre-of-mass energy of 13 TeV, 
shown in Fig. 6.31 from Ref. [196]. The central values and the size of the uncertainties 
can vary among the 3 PDF groups at low-mass and at high-mass, but in the precision 
mass region (say from 50-500 GeV), both the central values and the uncertainties are 
in remarkably good agreement with each other (especially so for the gluon-gluon case). 
This was not the situation for the gluon-gluon luminosity for the previous round of 
PDF fitting (CT10, MSTW08 and NNPDF2.3), where the envelope of the uncertainty 
bands for the 3 PDF groups yielded a PDF uncertainty for Higgs boson production 
through gg fusion that was about a factor of 2.5 larger than the PDF uncertainty 
band for any of the individual PDFs [196].'° The resultant PDF uncertainty was sim- 
ilar in size to the NNLO scale uncertainty for the cross-section [618]. As the scale 
uncertainty at NNNLO has shrunk to the order of 2-3% [156], the use of the older 
generation of PDFs would have left the PDF(+as,(mz)) uncertainty as the largest 
source of uncertainty for that cross-section, and implicitly for the determination of the 
Higgs couplings and other parameters that depend on the absolute knowledge of the 


13The CT14 gg PDF luminosity increased by about 1% in the Higgs region at 13 GeV (compared 
to the older generation CT10), the MMHT2014 decreased by about 0.5%, and the NNPDF3.0 PDF 
luminosity decreased by about 2-2.5%. 
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Fig. 6.31 A comparison of the PDF luminosities for the prior, CT14, 
MMHT2014 and NNPDF3.0 are shown for the gg initial state (left) and 
the qq initial state (right) for a centre-of-mass energy of 13 TeV. Reprinted 
with permission from Ref. [295]. 


Standard Model cross-sections for Higgs boson production. 

The PDF4LHC working group has been responsible for the determination of appro- 
priate recommendations for both the PDFs and the PDF uncertainties for the LHC. 
Implicitly, this also includes recommendations for the value of a,(mz) and its un- 
certainties. Through its history, the group has performed a number of benchmarking 
exercises and recommendations [135, 196, 268],!4 with the most recent being [295], 
using the 3 global PDF fits discussed above. Prior to this most recent document, the 
recommendation for a,(mz) was to use a central value of 0.118, with an uncertainty 
(at 90% CL) of +0.002, corresponding to a 68% CL uncertainty of +0.0012. The cen- 
tral value was chosen as a truncation of the world average value of as(mz) (0.1185), 
and the uncertainty was an enlargement of the value obtained in the world average 
+£0.0006) [799]. All of the global PDF groups (as well as HERAPDF) use this central 
value for their central fits, as well as producing alternate PDF sets with the value of 
as(mz) typically varying in increments of +0.001. In the recent PDF4LHC recom- 
mendation,the central value of a;(mz) has remained the same (0.118), but the 68% 
CL uncertainty was changed to +0.0015, partially to reflect the increased level of un- 
certainty adopted by the Particle Data Group [791], and partially to reflect that the 
same value of a,(mz) is recommended for both NLO and NNLO. (There is a question 
whether the value should be the same for the two orders, or whether the NNLO value 
should be slightly smaller.) 

Previous PDF4LHC recommendations have determined the uncertainty for a given 
cross-section by using the envelope of the error bands of PDF luminosities from the 
3 global PDF groups. This has the drawback that it lacks a rigorous statistical in- 
terpretation (the envelope can be determined by a few extreme sets). In the new 
recommendation, the uncertainty is obtained from the combination of error PDFs 
from the 3 groups using Monte Carlo replicas [885]. For CT14 and MMHT2014, an 
arbitrary number of Monte Carlo replicas can be generated from the Hessian eigenvec- 
tors such that the replicas represent the underlying probability density. It is possible 


— 
| 


14See also http: //www.hep.ucl.ac.uk/pdf41hc/ 
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Fig. 6.32 A comparison of the PDF luminosities for the prior, the 30 PDF 
set and the 100 PDF set, is shown for the gg initial state (left) and the qq 
initial state (right) for a centre-of-mass energy of 13 TeV. Reprinted with 
permission from Ref. [295]. 


to convert from a Hessian representation to a Monte Carlo representation. The use of 
a Monte Carlo representation allows a straightforward combination of PDF sets and a 
more rigorous determination of the 68% CL.!° The new PDF4LHC recommendation 
is based on a 900 Monte Carlo replica set, 300 derived each from the eigenvectors 
of CT14, MMHT2014, and NNPDF3.0. The 900 PDF set can be reduced to either 
30 error PDFs or 100 error PDFs (the latter with either symmetric or asymmetric 
errors) using 3 separate techniques [332, 334, 552]. Each adequately represents (1) 
the central PDF and (2) the PDF uncertainties of the 900 Monte Carlo replica prior. 
Details on each of these techniques and their specific applications are given in the 
PDF4LHC recommendation document and in the forthcoming Higgs Yellow Report 
4. Each technique also includes 2 member sets with a,;(mz) 0.0015 higher or lower 
than the nominal value of 0.118. The PDF+a,(mz) error can be calculated by adding 
these sets in quadrature with the PDF error sets. 

The PDF luminosity uncertainties for the 900 PDF Monte Carlo replica set, along 
with those from the PDF4LHC15 error set containing 30 error PDFs and from the error 
set containing 100 error PDFs, are shown in Fig. 6.32, for the gg PDF luminosity (left) 
and the gg PDF luminosity (right). The 900 PDF Monte Carlo replica set represents 
the best estimate by the PDF4LHC group for the PDF uncertainty, but its absolute 
accuracy is unknown. Both the 30 and 100 PDF sets are capable of reproducing the 
uncertainty of the prior in the precision mass range, and in the high mass range; the 
100 PDF set better reproduces the uncertainty for very low mass.'® The resultant 
PDF uncertainty for gg fusion production of a Higgs boson at 13 TeV is 2%, and the 
a,(mz) uncertainty is 2%, both comparable to the NNNLO scale uncertainty. 

A comparison of the predictions for Higgs boson production at 13 TeV relative 
to the prediction derived from the prior, for gg fusion (left) and vector boson fusion 


15The NNPDF3.0 set is already in this formulation and the CT14 and MMHT2014 Hessian sets 
can be converted to such. 


16The low-mass difference is basically due to low-mass final states produced at rapidities beyond 
the acceptance of the LHC detectors, which were not included in the construction of the 30 PDF set. 
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Fig. 6.33 A comparison of the predictions for Higgs boson production 
through gg fusion (left) and vector boson fusion (right) for a centre-of-mass 
energy of 13 TeV. Reprinted with permission from Ref. [441]. 


Table 6.2 The correlation coefficients between various Higgs production cross-sections at 
13 TeV. In each case, the PDF4LHC15 NNLO prior set is compared to the Monte Carlo 
and with the two Hessian reduced sets, and the results from the three individual sets, CT14, 
MMHT14 and NNPDF3.0. 


correlation coefficient 
Pore tt, Htt | tt,hW | tt,hZ | ggh, htt | ggh,hW | ggh,hZ 
PDF4LHC15_nnlo_prior 0.87 -0.23 -0.34 -0.13 -0.01 -0.17 
PDF4LHC15_nnlo_mc 0.87 -0.27 -0.35 -0.10 0.07 -0.01 
PDF4LHC15-_nnlo_100 0.87 -0.24 -0.34 -0.13 -0.02 -0.17 
PDF4LHC15_nnlo_30 0.87 -0.27 -0.43 -0.13 -0.04 -0.23 
CT14 0.09 -0.32 -0.44 -0.26 -0.03 -0.18 
MMHT14 0.90 -0.22 -0.52 0.08 -0.18 -0.33 
NNPDF3.0 0.90 -0.17 -0.21 0.18 0.52 0.49 


(right), is shown in Fig. 6.33 [441]. From these plots, the level of agreement of the 
predictions from the PDF sets provided by the PDF4LHC group with each other, and 
with the current and previous generation of global PDF sets, can be determined. 

The correlation coefficients between various Higgs boson production processes at 
13 TeV are shown in Table 6.2 [295]. Note the spread in correlation coefficients among 
the three global PDFs. No more than a single digit accuracy should be ascribed to the 
correlation numbers. 

A small number of error PDFs can be very useful in instances where the PDF 
uncertainties are used as nuisance parameters. It is possible, for example, to reduce 
the number of error PDFs needed to describe Higgs physics and backgrounds down to a 
smaller number, on the order of 7, using the METAPDF technique, without significant 
loss of precision [552]. Other techniques are available which also reduce the number of 
error PDFs needed [333]. 
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6.6 LHAPDF and other tools 
6.6.0.1 LHAPDF, Durham PDF plotter, and APFEL 


Libraries such as PDFLIB [812] were established to maintain a large collection of 
available PDFs. However, PDFLIB is no longer supported, making it more difficult for 
easy access to the most up-to-date PDFs. In addition, the determination of the PDF 
uncertainty of any cross-section typically involves the use of a large number of PDF's 
(up to several hundred) and PDFLIB was not set up for easy accessibility for a large 
number of PDFs. 

At Les Houches in 2001, representatives from a number of PDF groups were present 
and an interface (Les Houches Accord 2, or LHAPDF) [572] that allows the compact 
storage of the information needed to define a PDF was defined. Each PDF can be 
determined either from a grid in x and Q?, or by a few lines of information — essentially 
the starting values of the parameters at Q = Qo. The interface then carries out the 
evolution to any x and Q value, at either LO, NLO, or NNLO, as appropriate for each 
PDF. 

The interface is as easy to use as PDFLIB and consists essentially of 3 subroutine 
calls: 


e call InitPDFset (name): called once at the beginning of the code; name is the 
file name of the external PDF file that defines the PDF set (for example, CTEQ, 
MSTW, or NNPDF). 

e call Initpdf (mem): mem specifies the individual member of the PDF set. 


e call evolvepdf(x,Q,f): returns the PDF momentum densities for flavour f at a 
momentum fraction x and scale Q. 


Responsibility for LHAPDF has been taken over by the Durham HEPDATA project [891] 
and regular updates and improvements have been produced. Interfaces with LHAPDF 
are now included in most matrix element programs. Recent modifications make it pos- 
sible to include all error PDFs in memory at the same time. Such a possibility reduces 
the amount of time needed for PDF error calculations on any observable, as discussed 
below. 

One very useful tool is the Durham PDF plotter!” which allows the fast plotting and 
comparisons of PDFs, including their error bands. Recently APFEL Web!’ [238, 331], 
a web-based application for the graphical visualization of PDFs was developed that 
allows a very fast online calculation of PDFs and their associated errors, as well as 
PDF luminosities. Many of the PDF plots in this book were made using these two 
routines. 


6.6.1 PDF event re-weighting, Applgrid, and fastNLO 


NLO and NNLO programmes are notoriously slow. Thus, it can be very time-consuming 
to generate a higher order cross-section with one PDF, and then have to re-run the 
program as well for the 2N (where N is the number of PDF eigenvectors) error PDFs. 


\http://hepdata.cedar.ac.uk/pdf/pdf3.html 
Whttp://apfel.mi.infn.it 
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Such a step is in fact unnecessary, and most programs have the ability to use PDF event 
re-weighting to substitute a new PDF for the PDF used in the original generation. 

For each event generated with the central PDF from the set, PDF weights, for the 
error PDFs, can also be determined. Only one Monte Carlo event sample is generated 
but 2N + 1 (e.g. 57 for CT14) PDF weights are obtained using 


W=1,Wi= f (z1, Q; Si) f (2, Q; Si) (6.71) 
f (21, Q; So) f (£2, Q; So) 

where n = 1,..., Nevents, i = 1,..., 2N +1 [280]. Any statistical error due to the finite 
event size of the original event distribution is cancelled in the ratios that determine the 
error PDF relative weights. Such a PDF re-weighting has been shown to work both 
for exact matrix element calculations as well as for matrix element+parton shower 
calculations (modulo subtle effects in regions where Sudakov suppression is large). 
The PDF error weights can either be stored at the time of generation, or can be 
generated on the fly by the program. 

Since the PDF-dependent information in a QCD calculation can be factorized from 
the rest of the hard scattering terms, it is possible to calculate the non-PDF terms 
one time and then to store the PDF information on a grid (in terms of the PDF x 
values and their uy and uş dependence). This allows for the fast calculation of any 
hard scattering cross-section and the a posteriori inclusion of PDFs and the strong 
coupling constant a, in higher order QCD calculations. The technique also allows 
the a posteriori variation of the renormalization and factorization scales. This is the 
working principle of the two programmes fastNLO [686] and Applgrid [329]. These 
programmes are greatly used for calculation of the NLO matrix elements used in PDF 
fits, and as discussed earlier in Section 6.2.1 are now being adapted for use with NNLO 
matrix elements. 


6.7 Summary 


An accurate knowledge of parton distribution functions is crucial for precision LHC 
phenomenology. In this chapter, the techniques for the determination of PDFs were 
described, as well as techniques for the determination of the uncertainties of the PDFs. 
Global PDF fits involve data from deep-inelastic scattering, Drell-Yan and inclusive 
jet production, with increasing contributions from single photon and top production. 
Previous generations of fits used only data from the TEVATRON, HERA, and fixed target 
experiments, but the copious data from Run 1 (and now Run 2) at the LHC are starting 
to have a significant impact. 

The evolution PDFs using the DGLAP equation was revisited in more detail. Evo- 
lution is the great equalizer, as differences among PDFs from different groups, and the 
sizes of PDF uncertainties, decrease with increasing Q?. In general, there is a larger 
difference between PDFs determined with a leading order framework and a next-to- 
leading order framework, with a much smaller difference when going from NLO to 
NNLO. LHC predictions carried out with LO PDFs can differ greatly both in normal- 
ization and shape from those carried out in a purely NLO framework. NLO shapes 
can often be recovered with LO matrix elements by using NLO PDFs. 
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Parton PDF luminosities were defined and a new framework (PDF4LHC15) for 
combining PDFs from the three global PDF fitting groups was described. The PDF 
uncertainties for the three PDF groups can be summarized with a limited number of 
error PDFs, as low as 30. Lastly, a number of useful PDF tools were described. 

PDFs will continue to improve as more data from the LHC is included, and as more 
crucial processes are calculated at NNLO. As for the theory, and for the LHC data, 
what is presented here is just a snapshot of a rapidly developing field. 


7 
Soft QCD 


When looking at event displays at hadron colliders, it becomes apparent that the 
perturbative picture developed so far does not yet cover all aspects of what can be 
seen. First of all, there are many events with only a few — if any — particles hitting 
the central regions of the detector, either as charged tracks or as energy deposits in the 
calorimeters. Quite often these particles are relatively soft, with transverse momenta 
at around or below 1GeV. At the beginning of this chapter, in Section 7.1, the soft 
inclusive physics underlying such events will briefly be discussed. Building on a short 
presentation of the ideas behind typical models for soft strong interactions, which are 
quite often based on Pomeron and Regge pole physics, total hadronic cross-sections and 
their parametrizations will be introduced. This will extend to elastic and diffractive 
processes, which populate the forward regions of the detector, usually with low-p_ 
particles. 

Increasing the energy scales, the next section, Section 7.2 will focus on multiple 
parton scattering. This phenomenon is closely related to the fact that hadrons are 
extended objects, containing many partons. In contrast to the usual factorization the- 
orems underpinning the perturbative machinery discussed at great length in the first 
chapters of this book, this may lead to more than one parton pair interacting with 
each other. With increasing available energies, secondary parton—parton scatters may 
start populating regions of phase space usually considered to be mainly driven by per- 
turbative physics; this is true in particular for the multiple production of hard objects, 
such as gauge bosons or jets. But even without such relatively spectacular manifesta- 
tions of this phenomenon, multiple parton scatterings contribute to the overall particle 
yield in collisions, to the overall energies of jets, etc.. This manifestation of multiple 
scattering is often called the “Underlying Event”. Models describing this part of the 
overall event structure will also be introduced in Section 7.2. 

In the penultimate section of this chapter, Section 7.3, some light will be shed on 
the transition of the partons produced in the hard interaction, the parton showering, 
and the underlying event into the observable hadrons. While to date this transition can 
quantitatively be described by phenomenological models only, some qualitative ideas 
underlying their construction could be tested, and this interplay will be discussed. 

Finally, Section 7.4 rounds off this chapter with a very brief description of the 
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technology used in the understanding or parameterization of the decays of unstable 
particles. 


7.1 Total cross-sections and all that 


In this section, inclusive quantities such as the total, elastic, and inelastic cross- 
sections are discussed. After some definitions and general thoughts on how these quan- 
tities manifest themselves in the experiment, the optical theorem is introduced, which 
connects the total cross-section with the elastic scattering amplitude and, thereby, 
with elastic scattering. This motivates a closer look at the elastic scattering am- 
plitude and its analytic behaviour. This is known as S-matrix theory and, by to- 
day’s standards, seems old-fashioned. However, the resulting Regge calculus and the 
pomeron [815, 823, 824] embedded in it still serve as the basis for nearly all parameter- 
izations of the total, elastic, and inelastic cross-sections used today. After discussing 
the properties of these parameterizations, the concept of opacity is presented, which 
helps to maintain the unitarity of the total cross-section. It will also be used in the very 
short introduction to diffraction in hadronic collisions. The section closes by connect- 
ing the Regge picture of strong interactions with perturbative QCD and in particular 
the BFKL pomeron and its relationship to the pomeron introduced previously. 


7.1.1 Setting the scene 
7.1.1.1 Definitions 


By definition, the total cross-section of particle interactions in collisions is given by 
the number of events where the initial-state experiences some interaction leading to an 
observable change. From a theoretical perspective there are three types of processes 
contributing to the total cross-section: 


1. Elastic interactions, where the particles in the final state are identical to the ones 
in the initial state, but just experience a “kick” such that their direction changes. 

2. Diffractive interactions, where the final states can be seen as excitations of the 
initial-state particles, which populate the forward regions of the detector. This new 
final state might be roughly equal in mass to the incident particle and thereby 
would be understood as an excited particle; for example, a proton could become 
one of the N* resonances such as the N (1440). But the final state could also 
have a mass decidedly different from the typical hadronic mass scale. In such 
a case a “rapidity gap”, the absence of any QCD radiation over sufficiently 
many units of rapidity, usually more than three, between this state and any other 
particle emerging from the radiation is generally thought to signal a diffractive 
interaction. From a theoretical point of view, this is connected with the exchange 
of a (hadronic) colour-singlet state interacting with the diffracted initial-state 
particle. It should be noted that quite often also single and double diffractive (SD 
and DD) events are distinguished, depending on how many initial-state particles 
experience this kind of interaction. In addition, there are also events with more 
than one rapidity gap. Probably the most prominent representatives are central 
exclusive production (CXP) events, where a heavy system is produced at mid- 
rapidity with rapidity gaps on both sides. 
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3. Inelastic interactions, where the initial state gets seriously distorted, typically due 
to sufficiently hard QCD interactions leading to a number of final-state particles 
not only in the forward direction but also in the central detector region. 


By far and large, the total, inelastic, elastic, and diffractive cross-sections increase with 
energy, where typically 


Otot > Pinel > Sel > OSD > ODD > OCXP (7.1) 


and all being of the order of 1-100 mb at hadronic centre-of-mass energies that have 
been accessed until now. 


7.1.1.2 Measurement, selection bias, and minimum bias 


In this context it is worthwhile to discuss briefly the concept of minimum bias, typi- 
cally applied to a class of events analysed at collider experiments. The idea is that, by 
construction in every such experiment, events must pass certain thresholds in order to 
be further analysed and thereby are subjected to a selection bias. In minimum bias 
events, this selection bias is reduced as much as possible. This is limited by the realities 
of detector construction and data acquisition. In most cases, modern detectors have 
an inner magnetic field to improve the tracking performance of charged particles, and 
this, together with a limited efficiency for individual components, induces a minimal 
transverse momentum for particles to be detected with appreciable probability. This 
induces a non-trivial and certainly non-negligible bias in the selection of events, which 
must be properly treated and documented. It has therefore become customary at ex- 
periments such as the ones at the LHC to define different minimum bias characteristics 
based on a number of charged tracks with a minimal transverse momentum in the 
central detector. Similar reasoning of course also holds true for other process classes, 
where in modern analyses the definitions are increasingly based on visible objects in 
the detector rather than supposed production mechanisms. 


7.1.2 Reggeons and pomerons 
7.1.2.1 Complex angular momenta 


Consider the amplitude for a 2 + 2 scattering process (A) which, supposing symmetry 
of the interaction around the azimuth, can be expanded in Legendre polynomials 
P;(cos 0) as, 


Aab+ea(s, t) = 5 (21+ 1) a(s) Pi(cos 0). (7.2) 
1=0 
Here / labels the angular momentum and the polar angle 0 can be expressed through 
the Mandelstam variables s and t as 


2t 
cos@=1+—. (7.3) 

s 
The a; in the equation above are called partial wave amplitudes, and, correspond- 
ingly, this expansion is also called partial wave expansion. It has already been 
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Contour C’ 


Contour C 


I=4 Re(I) 


Fig. 7.1 The Sommerfeld—Watson contours C and C’. 


encountered before, in Section 5.2.5, where also the connection of amplitudes and 
cross-sections through the optical theorem has been discussed in detail. 

By continuing the angular momentum l to the complex plane, it is possible to 
rewrite the summation in the expansion above as an integral, namely 


Pi, P(L1+%) 


Aab>cals, t) = wf 21 + 1) a(l, t) Sata 


(7.4) 


This is known as the Sommerfeld—Watson transformation. The contour C sur- 
rounds the positive real | axis, excluding the poles at integer values of l, as shown 
in Fig. 7.1. In order to close the contour for large l (|I| —> oo), the partial wave 
amplitudes must fulfil 


a(l, t) < exp(al) for |I| + co. (7.5) 


These poles come with factors (—1)! from the sine, thus violating the inequality along 
the imaginary axis. In order to guarantee convergence for infinitely large l, two analytic 
functions a\=+*)(1, t) are introduced, such that the integral is separately finite for 
either of them. The 7 = +1 are called “signatures” of the corresponding partial 
waves, and 
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ne iP (L, 1+ 2t) 
2 sin(al) 


Aab>cals, t) = apa (+1) >> al™(1, t)| . (7.6) 


C n=x 


In a next step, the contour C surrounding the positive real | axis is closed in the 
complex-l plane by adding a half-circle and the line —C’, running in parallel to the 
imaginary l axis at Rl = —1/2, thus ranging from l = —1/2 + ico to l = —1/2 — iow. 
The overall result equals the residues of all poles inside this closed integration path, 
labelled by ny}, thus reflecting their signature. Assuming the integral to vanish for large 
absolute values of l, |I| — 00, this leaves only the poles and the integration along C’. 
The various pieces of this manipulation are sketched in Fig. 7.1. After it, the amplitude 
reads 


a ev itl 2t 
eae ae (7.7) 
peat) Pla 1+ 3 
s 2 2 4 sient L syo) 


The new poles reside at positions œ;(t) in the complex-! plane, replacing the values | 
in the “regular” real poles and contribute with a strength according to their partial 
wave amplitudes, denoted as 6;(t). They are called Regge poles with even and odd 
signatures, depending on the value 7 = +. There may be additional, more complicated 
analytic structures in the complex-/ plane, like branch cuts, etc., but they are beyond 
the scope of this very brief introduction. 


7.1.2.2 Reggeons and pomerons 


The remarkable thing about this simple picture is that in the region of s > |t| the 
Legendre polynomial is dominated by the first term. Furthermore, at large energies, 
s — oo, the integral along the new contour C” vanishes, and only the poles drive the 
behaviour of the scattering amplitude. In this limit the pole with the largest real value 
of a(t) = a,(t) will yield the dominant contribution, 


—ina;(t) 


paul nt+e 
2 


Aav-ea(, t) Bi) sO. (7.8) 
This can be viewed as the t-channel exchange of something with angular momentum 
a(t). This “something” defies usual particle definitions, as its spin is t-dependent and 
thereby it cannot be identified with an integer or half-integer number. Such an object 
is called a “Reggeon” [815, 823, 824]. One way to look at it is to identify it with the 
superposition of amplitudes for a variety of different particles or mechanisms exchanged 
in the ¢ channel. 

However, assuming the exchange of one Reggeon only, in a factorized picture, the 
amplitude can thus be written as 
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Yalt) 


Fig. 7.2 Factorized amplitude with a Reggeon exchange in the t channel; 
the couplings y(t) of the Reggeon to the external particles a, b, c, and d 
are made explicit, and the “propagator” term behaves like s“. 


—ina(t 
wey n+ e © Yac(t) Walt) gat) 


Aab>cals, t) 5} sin[ra(t)] P(a(t)) 


(7.9) 


where the y are the “couplings” of the Reggeon to the incoming and outgoing par- 
ticles, cf. Fig. 7.2. If a(t) assumes an integer value, sin[a(t)m] will vanish, and the 
amplitude will develop a pole. For positive integers this can be understood as the 
resonant exchange of a physical particle, for negative integers the T function will lead 
to a cancellation of the contributions. This is to be compared with the exchange of 
particles with mass m and spin J = a(m?) for positive t (or in the s channel), which 
also exhibits a resonant structure at 0 < t,s = m?. This observation led Chew 
and Frautschi to plot the spins of hadrons against their masses squared, discovering 
straight lines, as shown in Fig. 7.3. Such graphs are also known as Chew—Frautschi 
plots, and they provide a motivation to parameterize the Regge trajectories a(t) 
as 

a(t) = a(0) + a’-t (7.10) 


for all values of t, and in particular for the physical region of negative t. For a more 
detailed discussion, the reader is referred to the literature, for example [548]. 

The Reggeon amplitude will be equal to the forward elastic amplitude for t + 0, 
if a = cand b = d. In this case, the optical theorem expressed in Eq. (5.131) links the 
total cross-section to such an amplitude, and therefore 


Trot X s~, (7.11) 


In the specific case of the p-w trajectory, which is related to processes where isospin 
is exchanged (AJ = 1 processes), the exponent of s is smaller than 0, and there- 
fore the cross-section decreases with increasing s. This in fact has been observed 
experimentally, and it is in line with the Pomeranchuk theorem asserting that 
the cross-sections for all scattering processes involving any charge exchange vanish 
asymptotically [790, 814]. Conversely, at asymptotically high energies, processes with 
the exchange of the quantum numbers of the vacuum dominate the cross-section [530]. 
In fact, all experiments to-date exhibit a total hadronic scattering cross-section that 
increases slowly with the centre-of-mass energy of the hadronic collision. Attributing 
this behaviour to a single Regge pole, then it must carry the quantum numbers of the 
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Fig. 7.3 The Chew-Frautschi plot of the p-w trajectory, including a fit to 
the linear form of a(t). 


vacuum: this specific Regge trajectory is called the pomeron or Pomeranchuk pole. 
The increase of the cross-section with energy means that its intercept œp (0) > 1. 

At this point, it is important to stress that reggeons in general and the pomeron in 
particular are, per se, poles or cuts in the complex plane of the scattering amplitude, 
and not particles. The identification of reggeon exchange with the exchange of a tower 
of physical particles is not neccessarily a coincidence, since physical particles can and 
will be exchanged in scattering processes, but this relation is not bi-directional: there 
may be poles/cuts that cannot be identified with known physical particles. Incidentally, 
the pomeron is a prime example for it: it cannot be related to any known physical 
particle. 

To rephrase this in a different way: the pomeron is not a particle and it thus actually 
has a nature different from Regge trajectories like the one in the example above, the 
p-w trajectory, for two reasons. First of all, to-date, there are no strongly interacting 
particles with integer spins that could serve as manifestations of the trajectory for 
t > 0 or, stated differently, as resonances in s-channel scattering. This relates to the 
fact that in a picture inspired by perturbative QCD, the pomeron is thought of as 
the exchange of gluonic degrees of freedom, arranged in a color-singlet state. The very 
existence of purely gluonic bound states, essentially hadrons made of gluons only, also 
known as glueballs, remains a subject of speculation. Secondly, in the QCD picture, 
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however, the exchange of gluons does not lead to a simple pole, but rather to a branch 
cut. This indicates that a simple particle interpretation like for other Reggeons is not 
obvious: there is just no unique relation between mass and spin, a hallmark of “proper” 
particles. 


7.1.3 Simple parameterizations 
7.1.3.1 The total cross-section in simple fits 


The total cross-section in hadronic collisions increases with the centre-of-mass energy 
of the colliding particles. Comparison with scattering data up to 13 TeV c.m.-energy 
that have been reached at Run I of the LHC and up to about 50 TeV in cosmic rays 
suggest a total proton—proton cross-section that for large energies scales like 


s E 
oer Ge) = Op ( K ) ; (7.12) 


where as usual spp = E2,., the square of the centre-of-mass energy of the incoming 


protons. In typical fits [484, 485] 
op = 21.7mb and e = 0.0808. (7.13) 


In this “Donnachie—Landshoff fit”, the exponent e typically is related to the soft 
Pomeron intercept, ap, 


eS op—l. (7.14) 


€ is usually assumed to be smaller than (ap — 1) to account for the effect of destructive 
interference from multiple pomeron exchanges. However, e together with the normal- 
ization op must be obtained from data. The pomeron, having the quantum numbers 
of the vacuum, dominates the cross-section in the high-energy limit according to the 
Pomeranchuk theorem [815]. It is important to understand that this fit is at odds 
with the perturbative, or hard, pomeron discussed in Section 5.2.5. There, the leading 
pomeron intercept was given by 


4N; log 2 
= ——a 
T 


A s © 2.5- 0s, (7.15) 
cf. Eq. (5.226), and the total cross-section for elastic gg — gg scattering assumed the 
form given in Eq. (5.229), which goes beyond the simple pole of the soft pomeron idea. 

However, these simple fits often are further extended by adding other Reggeons, 
which effectively sum over the exchange of full classes of particles such as p, w, the 
corresponding f and a mesons, see Section 7.1.2. While the dominant pomeron, due 
to its quantum numbers, is blind to whether the colliding hadrons are particles or 
anti-particles, (or, indeed protons or neutrons), the sub-leading reggeons are not. This 
is reflected in the extension 


€ -=n 
à 8 s 
o (PP: PP) = op ( w) H OR ( w) , (7.16) 
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Fig. 7.4 The total pp (blue) and pp (red) cross-sections compared to the 
simple pomeron/reggeon fits of Eq. (7.17) 


where the exponent 7 related to the reggeon and the normalization o PP PP) are also 
fitted by [485] resulting in 
_ _ J 56.08 mb for pp 
n = 0.4525 and on = i mh fot op: (7.17) 


The comparison of this fit with data taken from the Review of Particle Physics by the 
Particle Data Group [799] is exhibited in Fig. 7.4. 

Another addition that is frequently made is the inclusion of a “hard pomeron”, 
which reflects the fact that the pomeron as obtained from perturbative QCD would 
yield a completely different — and larger — intercept ap — 1 ~ 0.5 at leading order. 
This value is at odds with data, which would favour a much smaller intercept for the 
hard pomeron of about ap—1 ~ 0.25, more in line with higher-order results. However, 
there is some evidence of the hard pomeron in the structure function F> and in the 
interaction pattern of more exclusive processes. 

Finally, note that similar fitting strategies have also been applied to different reac- 
tions such as mp or Kp-scattering. It is interesting to realize that the scaling behaviour, 
i.e. the exponents in, say, the simple fit of Eq. (7.12) are identical and that only the 
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normalization changes by about a factor of 2/3.! This is the additive quark rule, 
which was crucial in establishing the pomeron picture, see e.g. [717]. 

There is another important theoretical property of the total cross-section: asymp- 
totically it cannot possibly increase faster than log”s, with s the centre-of-mass energy 
squared of the incident particles. This is the Froissart bound [547], a manifestation of 
the unitarity requirement underpinning every reasonable field theory, supplemented 
with the idea that the S-matrix is analytic. Ultimately, this bound will have to kick 
in and thereby modify the relatively simple fits above. 

Pictorially speaking, the Froissart bound ensures that the proton behaves asymp- 
totically like a “black disk”. In partonic language this means that the parton density 
is limited, thus limiting or counter-balancing the amount of parton creation at larger 
scales as driven by the DGLAP equations, Eq. (2.31). In practical terms this means 
that at large enough scales, there must emerge non-linear terms in the DGLAP equa- 
tions, which account for parton recombination effects at large densities. 


7.1.3.2 Eikonal function and cross-sections 


Re-interpreting the parameterization of the forward or elastic scattering ampli- 
tude presents one way to remedy its behaviour at asymptotically high energies, thus 
guaranteeing that the Froissart bound is respected. This is achieved by assuming that 
the elastic scattering amplitude can be expressed through a Fourier transform into 
impact parameter space 


T(s,t) = as f B et E a(s, By). (7.18) 


Denoting with ¢ the three-vector of momentum transfer such that in the high-energy 
limit 

t=? aie. (7.19) 
In this limit, the elastic scattering amplitude and, correspondingly, its Fourier trans- 
form are purely imaginary. This allows to rewrite it as, 


a(s, By) = 7 fe (-26 5) — 1 j (7.20) 


where the eikonal function or opacity Q(s, B 1) has been introduced. The trick 
is now to identify the Regge-parameterization with the eikonal function rather than 
directly with the amplitude, 


€ n 
s S S 

As, B cr en one Gere: ee 7.21 

(Ba Rp (=) eee (=z) : an 


lIn fact, [485] finds a factor of 0.63 and relates this small deviation from 2/3 to the radius of the 
pion. For the Kp total cross-section the factor is even less, which may reflect an even smaller kaon 
radius, the fact that the pomeron couples differently to strange quarks, or merely the somewhat worse 
quality of the data entering the fit. Also, of course, Reggeon trajectories containing strangeness may 
start playing a role. 
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As a consequence, the cross-section will remain finite, as the exponential of Eq. (7.20) 
will never exceed unity — the hadronic disc will therefore never become blacker than 
black. 

The relationship between the total cross-section and the elastic scattering ampli- 
tude gives rise to a number of further relations. For example, expressed through the 
eikonal, the total, elastic and inelastic cross-sections read 


tot (8) = 


s Ve 
Sals) = if eB, la(s, B| = pe 


1 — exp (1% a) 


Another interesting quantity is the elastic slope B given by 


d doei(s, t) 1 / jc tig aes Qs, B) 
B = = d“ B4 B4 |1 
(s) È (os di Ms Fe LPL exp 5) 


(7.23) 

These equations manifest the relations between total and elastic scattering induced 

by the optical theorem. There is, however, a short-coming when comparing them to 

actual data, namely the conspicuous absence of any link to diffractive processes. This 

is related to the fact that especially low-mass diffraction, a process, where one or 

two of the incident hadrons transition to an excited state, can only be explained by 

the transition between scattering eigenstates. This is worked out further in the next 
section. 


E ee GS @B, (7.22) 


7.1.4 Diffraction 
7.1.4.1 Good—Walker states and low-mass diffraction 


When discussing low-mass diffractive excitations of the incident hadrons, it is sensible 
to introduce diffractive eigenstates |¢;), which are also known as Good—Walker 
states [592]. All N physical states |w,), both the excited and the elastic ones, i.e. 
the ones, where the outgoing particles are identical to the incoming ones, can then be 
written as linear combinations — coherent sums — of these |¢;), 


N 
yj) = Do aji |i) - (7.24) 


Assuming both the Good—Walker states and the physical ones to be orthogonal and 
normalized, 
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N 
(Gilek) = bie and J` janl? = 1, (7.25) 


i=1 
implies that the matrix formed by the aj; is unitary. The elastic amplitude of the 
incident particle |Y) = |1) is therefore given by 


(w|7]¥) = 5 leu ats (7.26) 


the average over the diffractive eigenstates. Here it has been used that the scattering 
operator T is diagonal in the basis spanned by the Good—Walker states. Therefore the 
elastic cross-section, given by the amplitude squared, is proportional to the square, 


oa x (TY. (7.27) 


In contrast, the amplitude for the diffractive production of any other state |Wxz1) 


reads 
(a 


which upon squaring and summing over all states k becomes 


5 (v 7 ve) (Yr 7 v) = 5 O14 Oj, 51 OR; TT; 
k ijk 
= X aay, TT} bi = (7?) i 
ij 


To have the diffractive component only, the elastic component must be subtracted. 
The interpretation of this is that the cross-section for the transition to any excited 
state, for diffraction, is given by the fluctuations 


t! v) = X auo} Th, (7.28) 


(7.29) 


Odiff.exc. & (7?) a or (7.30) 
7.1.4.2 Cross-sections with opacities and Good—Walker states 


Introducing such diffractive eigenstates implies that the opacity Q depends on the 
eigenstates that scatter. The cross-sections of Eq. (7.22) must be expressed in terms 
of the now eigenstate-dependent opacities Q; and the expansion coefficients a; and 
ar of the incident particles: 


On (Y, B 
orot(Y) = 2 | eB, N Jail? Jaxl? h exp ( Oe 2) 


[284E los? aul? fi- exp -2G 


i,k 


II 


OalY) 
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One(Y) = pen 2 losl la [i-e (env B.))| . (7.31) 


The c.m.-energy squared s of the incident hadrons has been replaced by a rapidity 


S 
ae 
Mhad 


Y = log (7.32) 


The differential elastic cross-section can be obtained from a Fourier back-transform, 


doa(Y) _ 1 f 2 iĝo Bi jai 2 Uuk(Y, B1) 
EE d“Bı <e 2 jail |ax|" |1 — exp 5 
(7.33) 


Expressed through the opacities Qig(s, B 1), single and double diffractive differen- 
tial cross-sections can be obtained similar to the elastic one above as 


doatsp,(Y) _ 1 2). 2). 12 
Se = ae È (el lo losl 
ij, 


x fë, o (a 3.) h exp ( Patr =) 
x fem, exp (ia Bi) h exp ( a 2oy) 


(7.34) 


2 


for the combination of elastic scattering and diffraction of incident beam particle 1 
and similar for a combination of elastic scattering and incident particle 2. 


7.1.4.3 Pomerons and diffractive processes 


Before discussing how pomerons and diffraction are tied together, it is worthwhile to 
remember that the pomeron exchange actually parameterizes the total cross-section. 
Going back to the optical theorem, Eq. (5.131), it is clear that the imaginary part of the 
elastic amplitude, i.e. the imaginary part of the amplitude for ab > ab is proportional 
to the total cross-section, which in turn is given by the sum over all final states n. 
This is schematically depicted in Fig. 7.5. A simple QCD interpretation of the pomeron 
follows immediately: on the level of amplitudes, and at lowest order in perturbation 
theory, one could identify the pomeron with the exchange of a single gluon. This is 
also known as the Low—Nussinov pomeron [735, 787],? see also Fig. 7.6 for a pictorial 
representation. In this case, the pomeron is “cut” into two single gluon exchanges, for 
the amplitude and its complex conjugate. In Section 5.2.5, this simplistic picture has 
been augmented with higher orders. 


?The pomeron intercept in this model is given by ap(t) = 1 leading to a constant cross-section. 
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Fig. 7.5 The relationship between pomeron amplitude and total cross- 
section. The vertical dashed line indicates either that the imaginary part of 
the amplitude has to be taken, or it symbolizes a sum over all final states, 
n. 


Fig. 7.6 The Low—Nussinov pomeron: in this model the pomeron is asso- 


ciated with the exchange of a single gluon. In the sketch, the thick blobs 
represent the gluons interacting with any of the valence quarks of the in- 
coming hadrons. 


It is advantageous to classify diffractive processes, in order to see their connection 
with pomeron exchange. First of all, there are low-mass single-diffractive or double- 
diffractive processes. These are essentially thosee processes that are directly captured 
by the transition between different Good—Walker states. They are identified with the 
exchange of a colour singlet between the two beam particles, on the amplitude level. 
The idea underlying this identification is that the exchange of colour, for example by a 
gluon, would lead to the radiation of coloured secondaries from this t-channel particle, 
which would result in the mostly soft production of hadrons filling the large rapidity 
interval between the two beams. 

There are, however, also processes where the mass of the diffractively produced 
system is not small, or where it is not produced at forward rapidities. They thus 
do not manifest themselves as excitations of the beam particles, although also in 
such cases the absence of particles in the rapidity interval separating the diffractively 
produced system from the other particles is thought of as exchange of a colour singlet. 
In both cases, these colour singlets are readily identified with pomeron exchange on the 
amplitude level: in the Low—Nussinov picture this would be realized by the exchange of 
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Fig. 7.8 High-mass single diffraction (left) and central exclusive produc- 
tion (right). Note the occurrence of a triple-pomeron vertex on the left 


plot, which plays an important role also in modelling the recombination of 
partons. 


two gluons forming a colour singlet on the amplitude level — the pomeron thus is not 
cut in such cases, see Fig. 7.7 for the case of low-mass diffraction. High-mass diffraction 
or central exclusive production processes can thus be thought of as a combination of 
cut and uncut pomerons, as exhibited in Fig. 7.8. In this case, a triple-pomeron vertex 
appears. 

It is interesting to note, however, that in some phenomenological models pomerons 
are thought of as objects with particle-like properties, like, e.g., an internal structure 
that could be resolved in a way similar to that of “usual” hadrons. In particular, in 
these models processes such as single diffraction or central-exclusive production are 
identified as the t-channel exchange of pomerons as physical particles, which are then 
subjected to a hard interaction with a quark or gluon or with each other. Contact to 
QCD is then made by defining an (equivalent) pomeron flux or similar, accompanying 
the incident hadrons, and a pomeron structure function, effectively parton distribution 
functions for these objects. While to some degree these models seem to work, they are 
of course essentially nothing but effective, albeit simplistic, parameterizations of a 
much more complicated mechanism; the basic problem with them is that the pomeron 
is just not a particle. In fact, in perturbative QCD the pomeron is not a simple pole, 
but rather a branch cut in the complex plane, which in turn renders any interpretation 
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of the pomeron as a particle an overly naive and physically wrong idea. 


7.2 Multiple parton interactions and the underlying event 


The focus of this book is mostly on the determination of cross-sections and other 
observables at hadron colliders through perturbative methods that rest on the factor- 
ization theorem, see also Sections 2.1.1 and 2.2.1. Essentially, this theorem states that 
in perturbative calculations of well-defined observables, the problem can be factorized 
into a “hard”, perturbative, and a non-perturbative part. In the former, amplitudes for 
the transition of incoming partons to the final state are calculated with perturbative 
methods from first principles. This has been elucidated in detail in Chapters 3-5. 

In contrast, the non-perturbative part deals with the question of translating the 
incoming physical hadrons into the partons, on which the full calculation rests, and 
on the translation of the outgoing partons back into the hadrons, on which all physical 
observations base. For both effects, there are no qualitative methods based on first 
principles. The factorization theorem accounts for the former translation, incoming 
hadrons to partons through the parton distribution functions. The PDFs, essentially 
being non-perturbative in nature, have to be taken from data, with methods discussed 
in Chapter 6. As the PDFs basically account for finding one parton in each incoming 
hadron, this begs the question what happens with the rest of the hadron in an event? As 
the partons taking part in the hard interaction do extract quantum numbers from the 
hadrons, not least colour degrees of freedom, there must be some mechanism of colour 
and flavour compensation. This break-up of the beam particles and the formation of 
the beam remnant particles must be non-perturbative and as such is subject to 
heavy modelling based on a few simple principles. 

But hadrons are extended objects which consist of more than one parton — if they 
consisted of one parton only, no PDFs would be necessary! This implies that there is 
a chance that more than one parton from each side enters the interaction, rendering 
the picture considerably more complicated. Such processes, with more than two in- 
coming partons and more than one scatter, are called multiple parton interactions 
(MPI); this term specifically refers to the interaction of more than one parton pair to 
contribute to the final state, in contrast to rescattering processes, where partons 
entering or leaving the hard process interact with the beam remnants or each other 
at time scales much larger than the typical time scales for the hard scatter. Issues 
related to the break-up of incident hadrons and multiple scattering in the process are 
the subject of this section. Turning the outgoing partons into hadronic bound states 
is usually described by phenomenological models, discussed later in this chapter in 
Section 7.3. 


7.2.1 Beam remnants 
7.2.1.1 Flavour degrees of freedom 


In this section, different aspects of the break-up of the incident hadrons in a hard 
scattering will be discussed, many of which can be formulated in the most convenient 
way in the context of full event simulations. 
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First of all, and most importantly, quantum numbers such as momentum, flavour, 
and colour must be conserved. For the latter, the limit of infinitely many colours is 
usually employed, which ensures that for every colour there is a unique colour partner, 
carrying the anti-colour. This assumption impacts on both the modelling of the beam 
remnants and the hadronization, where the latter is driven by the colour-singlet 
structures formed in previous stages of the event simulation. 

To illustrate this while keeping things as simple as possible, imagine an event at 
the LHC, where a W boson decaying into a lepton—neutrino pair was created, ud > 
W+ — €*v. Farther assume that the quarks did not undergo any parton showering 
or that they radiated only gluons. Then, the two quarks, the u and the d must be 
extracted from the incident protons. Knowing their Bjorken-x at the cut-off scale of 
the initial-state parton shower defines how much momentum is left for the proton 
remnants, which will consist of other partons. These remnant partons will in turn also 
compensate for the flavour and colour degrees of freedom, ensuring that these quantum 
numbers are also conserved. In the case of the u quark extracted from a proton this is 
straightforward: naively, the protons have |wud) as valence quark flavours. Extracting 
a u therefore just leaves ud as the flavours of the corresponding remnant. Using the 
notion of diquarks as carriers of baryon number, discussed in more detail in the context 
of hadronization models below, cf. Section 7.3.1, this remnant will therefore consist 
of a (ud) diquark. It will be assigned the anti-colour of the u quark colour and will 
take the full remaining momentum, thereby fulfilling the requirement that the parton 
configuration replacing the proton is colour-neutral and carries all of its momentum. 

A quick comment is in order here. Frequently in Monte Carlo simulations there are 
cases, where the three constitutent quarks have to be distributed over a diquark and 
a quark. The most naive way of achieving this would consist in merely giving each 
combination the same probability — in such a case the only small subtlety is that the 
diquarks come as spin-0 and some of them possibly also in spin-1 states. Alternatively, 
in cases where the original |wud) configuration of a proton is still available, one could 
use the proton wave-function in terms of quarks and diquarks [727], 

d l d z d : d 7.35 
p) = luud) = Se |upl(ud)o) + g ud) + g d) (T.35) 


This is how the more intricate case of the incident d would be handled — extracting 
d from |wud) leaves |uud + d) as the flavour content of the proton, which must then 
be decomposed into one diquark, carrying the baryon number, and two quarks. One 
of the quarks must carry the colour matching the d’s colour, while the other quark 
and the diquark will form another colour singlet. Assuming the most likely outcome 
for the flavour structure according to Eq. (7.35), a (ud)o diquark and an u quark 
will be formed in addition to the additional, flavour-compensating d. This leaves the 
task of distributing colours, answering the question if the u and the d or the d and 
the d form a colour-singlet. Again, different solutions present themselves, ranging from 
equal probabilities to a picture, where the dd stem from a gluon splitting, thus forming 
a colour octet. In this picture, the d and the (ud)9 and the u and the d are colour 
singlets. 

Replacing the u and d quarks in the simple example above with, say, s¢ + W~ does 
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(ud) 


Fig. 7.9 Example colour structures of some qq — W events: The left 
panel corresponds to the case of ud — W* with a gluon emission in ini- 
tial-state parton showering, while the right panel corresponds to the case 
of sc — W~, where the € is replaced by a gluon and an outgoing c quark in 
the initial-state showering. Colour connections are indicated by the short- 
-dashed lines. 


not alter the picture dramatically. In fact, following the initial-state parton shower to 
lower and lower scales, at some point the charm threshold is crossed and as a result 
the č quark will have become a gluon. In such a case, say an s quark and a gluon 
will be extracted from the incident protons. The first proton remnant will consist of 
the flavours |wud + 5), translating into one of the quarks forming a singlet with either 
the diquark or the 5, and the respective left-over 5 or diquark carrying the colour of 
the s quark. Assuming that the ss-pair had emerged from the splitting of a fictitious 
gluon below the scale of the dissociation will fix the colours such that the s quark will 
form a singlet with the diquark and the remaining quark will form a singlet with the 
5. For the proton, where the gluon is extracted, the structure is simpler. The flavour 
content of the proton remnant will still be |wud). The gluon carries a colour and an 
anti-colour, which will be compensated by an corresponding anti-colour assigned to 
the diquark and a corresponding colour assigned to the quark. The quark therefore 
will form a singlet with the colour-partner of the s quark in the other proton, while 
the diquark will be colour-connected to the c quark emerging from the g — cé splitting 
that occurred in the initial-state parton shower. For a pictorial representation of the 
two examples discussed, cf. Fig. 7.9. 


7.2.1.2 Momenta in hadron dissociation 


Having presented some relatively naive ideas of how flavours and colours are distributed 
in the break-up of incident hadrons, there is one last point to be discussed, namely 
the momentum degrees of freedom. Up to now, the overall transverse momentum of 
the system produced in the hard collision — in the examples above the W bosons 
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produced in ud + W+ and sč > W7 — are entirely given by the parton shower. In 
the first example, where the relatively unlikely case of no emission was assumed, the 
W= boson therefore would have zero transverse momentum. Events like this therefore 
would lead to a visible spike of the pı distribution of the W boson, or of Drell-Yan 
pairs in the case of qq — Ll events, at pı = OGeV. This of course is not what is 
being seen experimentally, where the p, distribution apporaches zero for vanishing 
transverse momentum, and then increases from there until it reaches the Sudakov 
peak, which in the case of vector bosons at the LHC is located at about 5 GeV. 
Therefore, there must be some source of relatively small transverse momentum for 
such systems, beyond the parton shower. 

It is quite straightforward to assume some kind of “Fermi motion” of the partons 
inside the incident hadrons. In the case of collisions with hadrons in the initial state, 
this would manifest itself as some additional intrinsic or primordial transverse 
momentum (intrinsic or primordial k,) that the beam partons assume in the 
break-up of the hadron. 

This presents another non-perturbative effect, similar to the soft form factor of 
Qr resummation first encountered in Eq. (2.172) and further discussed in Chapter 5. 
And similar to the parameterizations used there, it is customary to employ some rela- 
tively simple form for the generation of the intrinsic k], like, e.g., a simple Gaussian, 
supplemented with a cut-off to prevent the generation of too large transverse mo- 
menta. The parameters of such functions must then be fitted to data, usually the low 
transverse momentum region of the lepton-pair in Drell-Yan processes. 

In the dissociation of the second proton into partons, its momentum also has to 
be distributed. This is yet another issue where no first principles are available and 
simple ideas are invoked to guide the modelling. One of these ideas would be to select 
momenta — or, analogously, the Bjorken-x — of the quark and gluon partons emerging 
in the hadron break-up according to the PDF at the small parton shower cut-off scale 
O (1 GeV). The remaining momentum would be associated with the diquark degrees 
of freedom. 

Even more sophisticated models can be devised for the break-up of the incident 
hadrons, improving the way, flavours, momenta, or colours are distributed over the 
outgoing quarks, see for example [855]. But any model will become more involved 
when the underlying event is added to the simulation. 


7.2.2 The underlying event 
7.2.2.1 Definition 


In events with a hard interaction, described through perturbation theory in the usual 
framework of factorization, the underlying event adds further activity to the overall 
deposit of particles and energy in the detector. This introduces some ambiguity in the 
definition of what the underlying event actually is. The answers range from “everything 
apart from the hard scatter itself, but including parton showering in both the initial 
and final state plus the corresponding activity from hadronization, hadron decays, 
etc.”, to “the additional activity after all Bremsstrahlung, hadronization, and hadron 
decays related to the hard signal interaction has been taken into account”. The latter 
is the definition that will be used in this book. Correspondingly, the underlying event 
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consists of all contributions associated with the additional scatters in multiple parton 
scattering, their fragmentation, and effects coming from remnants. 

It is remarkable that, even with this relatively restrictive definition, the activity 
attributed to the underlying event is significantly higher than in soft collisions, usually 
associated with minimum bias events, at the same energy. In addition, fluctuations in 
particle multiplicity and energy flows are significantly larger in the underlying event. 
A simple interpretation is that with increasing scales being probed in the collision, 
the relative impact parameter of the two incident hadrons must decrease — in other 
words, by biasing the event selection towards larger momentum scales, the overlap 
between the colliding hadrons becomes larger, offering other partons a higher chance 
of interactions. This leads to an effect known as the jet pedestal effect: the hard 
objects sit on a “pedestal” of underlying activity, which becomes larger with increasing 
scale of the hard process. 


7.2.2.2 Evidence for the pedestal effect 


A traditional example for this is the increase, w.r.t. minimum bias events, of activity as 
a function of the transverse momentum of the hardest charged track in an event. This 
has been quantified for the first time by the CDF experiment at the TEVATRON [127, 
526]. Looking at Drell-Yan production at the LHC, however, an even more striking 
example for this pedestal effect emerges. To see how this works, assume the Drell- 
Yan pair — effectively a Z boson decaying into leptons — to have some transverse 
momentum, p%. The generation of this transverse momentum is predominantly driven 
by the emission of a hard parton in the opposite direction, and in most events with a 
sufficiently sizable pe one would thus find a jet back-to-back with the Z boson. This 
allows to define three regions of equal size in the transverse plane: 


e a “towards” region with |A¢| < 60°; 
e an “away” region with |Ad| > 120°; 
e and two parts of one “transverse” region with 60° < |Ad| < 120°, 


where the orientation is such that the Z resides at ø = 0°. By far and large, QCD 
particles from the hard scatter that produced the Z boson will be associated with 
the jet system, which is concentrated in the away region. In contrast, there will not 
be much of such primary QCD activity in the transverse region and even less in the 
towards region. A sizeable and even dominant fraction of particles in these regions will 
stem from secondary activity, the underlying event. 

This has been analysed in more detail by the ATLAS collaboration at Run I of 
the LHC [28] with spp = 7TeV, where the overall activity was measured through 
charged particles with |n| < 2.5 and py > 500MeV. In Fig. 7.10, data for the 
number of charged tracks Nen as a function of the Z transverse momentum pe in 
the transverse and toward region are compared with results from the different event 
generators. Particle production increases with pe, and the overall activity is higher 
than in minimum bias events at the same energy. The same trend is also visible in 
the sum of the transverse momenta of the charged tracks ` p; in both regions, as 
displayed in Fig. 7.11. It is remarkable that there are not only more particles with a 
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Fig. 7.11 Sop. vs. pf at spp = 7TeV in the transverse (left panel) 
and towards (right panel) region, as measured by the ATLAS collabora- 
tion [28]. Charged tracks with |n| < 2.5 and py > 500MeV are consid- 
ered. Reprinted with permission from Ref. [28]. 


larger total transverse momentum, but that they also become harder — their average 
transverse momentum increases as well, cf. Fig. 7.12. 


7.2.2.3 Consequences of multiple parton interactions 


A phenomenologically relevant effect of such multiple interactions is that they increase 
the overall final-state activity of events. Despite many of these secondary scatterings 
happening at relatively low scales, well below the transverse momentum scales usually 
associated with jets, they visibly increase the total energy released in the form of 
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Fig. 7.12 (p1) vs. pí at spp = 7TeV in the transverse (left panel) 
and towards (right panel) region, as measured by the ATLAS collabora- 
tion [28]. Charged tracks with |n| < 2.5 and pı > 500 MeV are consid- 
ered. Reprinted with permission from Ref. [28]. 


final-state particles through their multiplicity. One of the reasons for this increase is 
that they also alter the overall colour flow of the event, adding more colour sources, 
and thus providing many more directions along which soft emissions can proceed. This 
ultimately translates into many more seeds of hadron production. 

Another effect is directly related to jets. From previous considerations, it is clear 
that QCD final-state radiation carries away energy from the primary hadrons, which 
may end up outside the jet. For relatively simple, cone-shaped jets with a radius of R 
this yields a contribution that is roughly proportional to log(1/R), coming from the 
integral over opening angles. Hadronization corrections, discussed later in this chapter, 
scale like 1/R, and also contribute negatively to the overall jet energy, as some of the 
hadrons emerging may end up outside the jet. On the other hand, the underlying event 
adds energy back to the jets, usually in proportion to the jet area, x R?. For more 
details, cf. [432]. 


7.2.2.4 Simple models for the underlying event 


At the moment models for the underlying event fall, broadly speaking, into two cate- 
gories: first of all, there are models based on simple parton-parton scattering, imple- 
mented in standard event generators such as PYTHIA, HERWIG, or SHERPA. Alternative 
models based on Regge-theory [598] and employing the notion of cut pomerons form 
the basis of the underlying event and minimum bias modelling in event generators 
such as PHOJET [514] and Epos [889]. 

Concentrating on the former, more simplistic class of models first, the logic un- 
derlying their construction is that the cross-section for parton-parton scattering with 
transverse momentum larger than some cut-off Pi min, 7232(P1,min) becomes larger 
than the total proton-proton cross-section Opp tot; 
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dô 
02-42(P 1 min) = J dp? Sas = Opp,tot - (7.36) 
PI 


P? mi 
Femin PL,min®%5 GeV 


Here, 62-42 is given by Eq. (2.52), where the matrix element squared |Map-+n|? is 
given by the sum of all partonic 2 + 2 QCD scatters at leading order. At the LHC the 
saturation of the total cross section occurs for values of Pi min of the order of about 
5-10 GeV, depending on the c.m.-energy of the protons, on the PDFs being used for 
the parton-level calculation, and on the choice of renormalization and factorization 
scales. 

The interpretation of Eq. (7.36) is pretty straightforward: if the partonic scattering 
cross-section is larger than the total inelastic or, most probably the non-diffractive 
(ND) hadronic cross section, then there must be more than one partonic scatter per 
hadron interaction, 


o22(PLmin) > 4 (7.37) 


09-42(P 1 min) > Opp, ND —> (Necatters(P1,min)) = ee 
PP, 


This presents the starting point for a class of relatively simple models for the 
underlying event. In these models the underlying event emerges as the superposition 
of largely independent parton-parton scatters, with two partons that are oriented back- 
to-back in the transverse plane. The number of these scatters is distributed according 
to a Poissonian distribution defined by (Ngcatters(P1min)) in Eq. (7.37), but possibly 
reduced by one, if the hardest partonic event — the signal event — is a QCD event. 
The number of actual scatters Nscatters is either predetermined by a Poissonian, as in 
JIMMY [298], the model implemented in the HERWIG family of event generators, or it is 
generated dynamically, as in the model [857] realized in the PYTHIA event generators. 
The basis of this latter model is a structure that looks like a Sudakov form factor 
and, analogous to the case encountered already in the construction of parton showers, 
yields the probability of no further scatter to happen between a higher scale Q? and 
a lower scale t > pf min: 


Q? 
1 dé 
AVE) (Q?, t) = exp |- fo ol a 7.38 
(Q”, t) EE Praa (7.38) 


Equating this with a random number allows the transverse momentum squared t, at 
which the next scatter appears, to be determined, implying an ordering in a hardness 
scale given by pł. Once the p, of the scatter is fixed, also the Bjorken—x of the 
incoming particles (or the rapidity of the overall system) can be selected from the 
differential parton-level cross-section déo_,2/ dp? . 

This naive treatment has an unpleasant implication, namely a steep dependence on 
the value of pı min, driven by the divergent structure of dô2—2/dp} œ 1/ pi or worse 
for small values of pı . To cure this problem, a phenomenological ansatz is often used, 
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namely replacing p? with (p3 + PŽ o) This also allows the elimination of p min, as the 
partonic cross-section is suitably regularized. Focusing on the approximate behaviour 
of the differential cross-section with respect to p? and keeping only these terms, this 
implies a reweighting of the differential cross-section Eq. (7.38) by a factor 


as(pi +Pio) mt 
as (p* ) (pi. +P} o)? 


(7.39) 
Comparison with data suggests that this new parameter, pio scales with the c.m.- 
energy of the colliding hadrons, in a way similar to the total cross section: 


E 
Fres 


p1 (E) = ( ) Puo(Es) (7.40) 


Here Eyer is some reference scale and the exponent 77 is related to the pomeron intercept 
driving the rise of the total hadron cross section. 

This relatively simple way of generating the series of 2 — 2 parton scatters forming 
the underlying event can be further modified by assuming a distribution of partons 
in the incoming hadron. This means that the PDFs entering the simulation become 
also dependent on an impact parameter b, basically the distance of the two colliding 
hadrons. Usually it is assumed, though, that the impact parameter dependence is on 
average only, and therefore there are no “positions” in transverse space associated 
with the individual scatters. In addition, it is assumed that the impact parameter 
dependence factorizes from the PDFs such that 


fijn (ex HF; z) Fishes (x. HF; z) = fijam (1, HF) fj/ha (£2, HF) A(b), (7.41) 
where A(b) denotes a matter overlap function. Different parameterizations are used 
for A(b), including single and double Gaussians and forms that are inspired from the 
usual electromagnetic nucleon form factors. In some recent publications, these overlap 
functions have also been assumed to depend on the Bjorken—x of the partons [419]. 
What they all have in common is that the number of scatters Nscatters increases with 
the overlap A(b). An important consequence of adding the impact parameter is that the 
b-integrated distribution of Nscatters becomes broader — in other words, the underlying 
event model supports larger fluctuations, in agreement with data. 

The individual partonic sub-events undergo parton showering, which will decorre- 
late the two outgoing partons in the transverse plane and also contribute to the overall 
yield of hadrons produced. One way to impose four-momentum conservation on the 
event is to reduce the total energy of the incident hadrons by the amount carried away 
by the initial partons after the parton shower of each partonic sub-event has termi- 
nated. This is the logic underlying the models in PYTHIA and SHERPA, where it is 
assumed that the PDFs for the beam hadron after some partons have been extracted 
are just the PDFs of the same hadron, but with reduced energy. Alternatively one 
could also just stop the generation of additional scatters if they exceed the total en- 
ergy of the hadronic system, possibly even removing the last parton scatter, which is 
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the algorithm in HERWIG. More complicated models introduce correlations between the 
sub-events beyond this simplest four-momentum and flavour conservation and thereby 
possibly also modify the PDFs, cf. for instance [855]. 

Further details that must be taken care of in these simplest models are the colour 
flows between the individual partonic sub-events, which will influence the production 
of hadrons during hadronization. While infrared-safe observables such as energy flows 
etc. will remain more or less unaltered by hadronization, hadron multiplicities and 
the soft part of their energy and p,-distributions are highly sensitive to this. This 
adds yet another dimension of model building and assumptions into the simulation, 
and consequently various models differ quite a lot. The answers here range from a 
completely random assignment of colours, with the only constraint of overall colour 
conservation to models, in which the overall “length” in -¢ space of distances between 
colour-connected pairs of partons is minimized. 

The model presented here has to be taken with more than one pinch of salt. Due 
to the low scales probed — down to P1 min — also the range of Bjorken—x extends to 
fairly low values of the order of 1076, which in turn introduces a sizable dependence 
on the parton distribution function used in the calculation. This is just one indicator 
of a more generic problem: any fixed-order calculation probing such scales becomes 
unstable, not least due to the emergence of large logarithms of the BFKL type. It 
therefore cannot be over-stressed that models such as the ones presented here certainly 
are overly simplistic and are at best only able to give some qualitative ideas about the 
true physics. 


7.2.2.5 Including rescattering and more 


In the latest PYTHIA versions, PYTHIA 6.4 and PYTHIA 8, the simple model outlined 
above has been enhanced by two additional ideas. 

First of all, “interleaving” of the multiple parton scatters with the parton showering, 
and, in particular, initial-state parton showering, has been introduced, which basically 
amounts to a competition between both. To facilitate this, a combined no-emission 
probability has been introduced, which schematically reads 


2 


Q 
dP(Q?, t) dPpgs | dPmpr I 2 {dPpgs  dPmrPI 
= . —fd ; 7.42 
dpi ay ap) OP j E a AA a 


Naively, this seems to factorize and therefore to not alter the pattern of parton emis- 
sion through the parton shower (indicated by subscript “PS”) and through multiple 
parton interactions (“MPI”). However, the effect of flavour and especially momentum 
conservation will change the patterns, since both parton showering in the initial state 
and multiple parton scatters take energy out of the incident hadrons, and therefore 
the cross-talk impacts on the individual parts. As a by-product of this idea, it becomes 
possible that two parton scatters, that have been independent from each other when 
they were generated at relatively high p1 -scales, are found to point to a common “an- 
cestor” parton in their initial-state evolution, see also the left panel of Fig. 7.13. This 
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Fig. 7.13 Sketch of possible improvement of simple models for the under- 


lying event, available in recent versions of PYTHIA: Two scatters can have 
a common “ancestor” parton (left panel), or there can be rescattering ef- 
fects (right panel). In both cases, parton showering effects and emissions 
of secondaries have been ignored. 


is the manifestation of recombination effects in the DGLAP evolution of multi-parton 
distribution functions [679, 690, 845, 861]. 

As a second effect, final-state rescattering has been implemented in PYTHIA and 
studied in [418]. Rescattering essentially amounts to those processes, where one of the 
outgoing partons of one parton scatter acts as an incoming parton for another scatter, 
cf. the right panel of Fig. 7.13. 


7.2.3 Double parton scattering 
7.2.3.1 A simple model for double parton interactions 


The simplest of multiple parton scattering processes is hard double parton scatter- 

ing, processes with two hard scatters, where four incoming partons interact pairwise 

and create corresponding final states. There has been a number of proposals with vary- 

ing degrees of sophistication to access such processes from a theory point of view. Most 

of them based on the idea that the differential parton level cross-section 6x+y for the 

production of a composite system X +Y can be written as [153, 588, 637, 759, 800, 801] 
ain, m dad g dag 


dôx+y = dôf" y + o (7.43) 


where the superscript 7" denotes direct production in one two-parton scattering and 
Ceg iS a process-independent parameter with the dimensions of a cross-section. The 
factor m before the double-parton scattering contribution is used to obtain the correct 
symmetrization; it is m = 1 for X = Y and m = 2 otherwise. The ® symbol in the 
ratio and the usage of differential cross-section hints at the necessity to apply identical 
cuts on the final state and on possible correlation effects in the two-parton PDFs. 

A number of subsequent refinements and alterations include, among others, at- 
tempts to connect Cep with total hadronic cross-sections or the geometric size of the 
hadrons [637, 742], a scaling of this quantity with the scale at which the hadron is 
being probed, and some naive inclusion of correlation effects in two-parton PDFs. 
The latter can be achieved, for instance, by writing the two-parton PDFs as a simple 
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Jonni T2; Ln) = (1 A £2) fpı/h(£1; Wr) fpa/h(£2; ui) (7.44) 


of the conventional single-parton PDFs [153, 637]. This simple picture has been re- 
fined by considerations concerning various types of correlation effects [464, 465, 748], 
by invoking DGLAP-type evolution equations for parameterizations of multi-parton 
PDFs [679, 845, 861, 903], including also different resolution scales [832, 833], and by 
the use of generalized two-parton distribution functions, introduced in [258-260]. 
With the running of the LHC a proper description of DPS has found renewed 
interest, as documented in a wide range of publications, which typically go beyond 
the simplistic picture outlined above. In any case, any models for multiple parton 
scattering must be compared with suitable data, like, for instance [89, 109, 113] from 
the TEVATRON and [17, 379, 387, 656] from the LHC. It is fair to state here that any 
further development of such models and probably also any serious attempt to arrive 
at a first-principles based theory will necessitate further measurements of such effects. 


7.2.3.2 Typical processes 


Regardless of the accuracy of the naive expression in Eq. (7.43), it at least roughly 
predicts the actual size of double-parton scattering cross-sections. Measurements to- 
date point at values of the phenomenological parameter oeg of the order of 10-20 mb, 
about 20% of the total hadronic cross-section. This immediately implies that double- 
scattering is observable only in those processes, where 69!" x 69" ~ oep x ôf" y, the 
product of the production cross-sections for the single systems being of the order of 
the product of hadronic cross-section and the direct production cross-section for the 
combined system. This leads to combinations of individual processes, where single weak 
gauge bosons or pairs of jets are produced, in particular processes where {X}+{Y} is 
for example given by {V}{V'}, {V}{j7}, tas} {i}, or {77} {iy}, most of which have 
been considered in the literature. In addition, DPS has been investigated in rare QCD 
processes, like for instance the production of J/w pairs. In order to systematize the 
considerations, in Table 7.1 a variety of relevant cross-section estimates at next-to- 
leading order are tabulated. 

Cuts on the final-state particles can further improve the ratios of DPS and direct 
production cross-sections. One example for such a cut relies on the assumed kinematics 
of DPS processes, being composed of predominantly uncorrelated single parton scatter 
processes in one hadronic collision. The idea there is that, by far and large, systems 
produced in a single-parton scatter experience a very limited “kick” in transverse mo- 
mentum space only, as the generation of additional transverse momentum has to be 
ascribed to further emissions. For example, for Drell-Yan type processes, the produc- 
tion of weak gauge bosons, the peak of the transverse momentum distribution and its 
mean are well below 10 GeV. In other words, two particles produced in a hard single 
scatter, such as the lepton pair in the example here, tend to be relatively well balanced 
in transverse momentum space, 


a 
M+ =o x o, (7.45) 
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Table 7.1 Indicative cross-sections for direct and DPS component of typ- 
ical process combinations at an 8 and a 13 TeV LHC. All cross-sections 
are evaluated at next-to-leading order with the central set of the CT14 
NLO PDF [489], and with renormalization and factorization scales equal 
to the scalar sum of the transverse momenta of the final-state particles, Hr 
The gauge bosons decay into one (different) family of leptons each, so that 
branching ratios have been factored in. Jets, including b jets, are defined in 
the anti-kr (D = 0.4) algorithm with a transverse momentum of 20 (30) 
GeV at 8 (13) TeV; to avoid the well-known problem of diverging di-jet 
cross-section for identical p1 cuts, for the jj process the leading jet must 


have a transverse momentum of pP > 25 (35) GeV. Photons are subject 
to the same transverse momentum cuts as the jets. A value of oog = 15 
(20) mb is assumed for the two energies. 
LHC at 8 TeV LHC at 13 TeV 
Process ôt" [pb] PPS [pb] ôt" [pb] GPPS [pb] 
wt 6.66 - 10% - 11.0- 10° -= 
W- 4.72 - 10 — 8.20 - 108 = 
Z 1.06 - 10% S 1.82 - 10° — 
jj 0.173 - 10° — 0.086 - 10° = 
TY 96.7 - 103 = 46.9 - 103 = 
bb 0.57 - 10° - 0.27 - 10° = 
Wtwt || 14.1-10-3 | 2.96.1073 ]| 32.6.1073 | 6.05.1073 
W-W- || 5.39.1073 | 1.48 -1078 14.5- 1078 | 3.36- 107 
WW- 602 - 1078 2.10 - 1078 1250 -1078 | 4.51 -1078 
ZZ 16-1073 | 0.074- 1073 34-1073 0.17 - 1073 
ZW+ 47-1073 0.47 - 1073 96-1073 1.00 - 1078 
ZW- 26-1073 0.33 - 107% 60-1073 0.75 - 107% 
Wtj7 || 0.384-10% | 0.077-10 |] 0.410-103 | 0.047- 103 
W- jj 0.259- 10° | 0.054- 10° 0.289 - 10° 0.035 - 10° 
Zjj 0.074: 10° | 0.012- 10° 0.083-10 | 0.0078 - 103 
Wr bb 1.97 0.25 1.80 0.15 
W- bb 1.17 0.18 1.12 0.11 
Zbb 1.26 0.04 1.41 0.02 
jjjj 3.0 - 10% 2.0 - 10% 1.1- 10° 0.370 - 10° 
TITY 7.2- 108 1.12- 108 3.0 - 108 0.20 - 108 
To illustrate this idea further, consider the case of ZZ production. In the DPS 
contribution, there will usually be two pairs of leptons, whose transverse momenta 


compensate each other, as in the equation above. In contrast, in the direct contribu- 
tion, the two Z bosons themselves will recoil against each other, and their individual 
transverse momentum usually will be larger than in the single-Z case, therefore leading 
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Fig. 7.14 Sketch of hadron y (left) and pı (right) spectra in e~e* anni- 
hilations. 


to the transverse momentum of the lepton pair being different from zero, 


HSD, (7.46) 
and the individual transverse momenta not compensating each other. Cuts like these, 
looking for recoiling pairs, etc., of course further enhance the DPS contribution over 
the direct component and allow for a better signal (DPS) to background (direct) rate. 


7.3 Hadronization 


7.3.1 Some qualitative statements 
7.3.1.1 Motivation: Jet masses from hadronization 


To understand and estimate the visible impact of hadronization, consider the case 
of jet production at a lepton collider. From LEP data it is known that the rapidity 
spectrum of hadrons is more or less flat, when rapidity is defined with respect to the 
main event axis, usually assumed to be the direction of the qq pair at leading order. 
At rapidities of the order of log mpaa/E hadrons cannot be produced anymore, due to 
energy conservation effects, resulting in the hadron spectrum vanishing fast. At the 
same time, the transverse momentum spectrum of hadrons with respect to the same 
axis roughly follows a Gaussian profile, see Fig. 7.14. Defining an expectation value of 
this profile, 


co co 2 
1 
(o) = J vipien) = Jopi exp (-2) oa ea 1GeV_ (7.47) 
0 0 


to be approximately given by typical hadronic radii of about 1 fm or Agcp & 250 MeV, 
the energy and momentum of a jet is given by 
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E = i dy cosh y / dpipip(p.) = (p) sinh Y 


Co 


aysinhy f dpspip(p.) = (p) (cosh ~ 1). (7.48) 
0 


P= 


eee 


The mass M of the jet therefore is given by 
M = F?’ — P? = 2coshY(p)? = 2E(p), (7.49) 


something of the order of about 2-3 GeV for a 100 GeV parton jet. This is an indication 
that in order to fully understand jet physics at the LHC, on the level of about 10%, it 
is important to also study non-perturbative effects such as hadronization. 


7.3.1.2 Single particle fragmentation 


A simple approach to describe hadronization is by considering the production of one 
specific hadron in a given process; consider, for example, e~e* annihilations into one 
specific hadron h and an otherwise arbitrary final state, e~e* —> h+ X. In order for 
this to happen, pictorially speaking, one of the quarks or gluons must turn into such a 
hadron plus other hadronic matter, the latter in order to compensate for colour degrees 
of freedom. This is because a single parton is a colourful object, while a hadron is not, 
so some colour neutralization must take place. Schematically, and assuming a quark 
q to fragment, therefore a process like q —> h + X must be described. Transitions 
like these could be called single-particle fragmentation, and they are usually described 
by fragmentation functions Dj q(z, pF), which, at leading order, parameterise the 
probability that at a scale up a hadron h emerges from a quark q, carrying a light-cone 
fraction z of its momentum. For the example process this implies that at leading order 
the cross-section for e~et — h+ X is given by 


doe-e+—>h+X (z) 


ay = Oe-et+ qq Dayal uF) + Drjq(Z, ur). (7.50) 


Similar to the PDFs fragmentation functions fulfil certain sum rules; for instance, 
as every parton q must eventually hadronize into at least one hadron, 


5 J dzDyjn(z, ur) = 1. (7.51) 
h 0 


To push the analogy a bit further, the dependence of the fragmentation functions on 
the factorization scale ur is logarithmic and given by evolution equations similar to 
the ones already encountered for the PDFs, see below. However, ignoring this scaling 
behaviour and assuming that Dg/n(z, wr) = Dg/n(z) implies that the probability to 
find a 10 GeV hadron h in a jet emerging from a 20 GeV parton q is identical to 
finding a 20 GeV hadron h of the same kind in a jet emerging from a 40 GeV parton 
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q. This in turn supports the notion of universality of the hadronization process once 
it is described at the same factorization (or hadronization) scale. 


7.3.1.3 Local parton—hadron duality 


One idea that is somewhat related to this behaviour is that observables at the hadron 
level, such as momentum and energy flows and, to a lesser extent, flavour quantum 
numbers, roughly follow their distribution at the parton level. This concept is known as 
local parton—hadron duality (LPHD) and was introduced in [177]. This idea has 
already been encountered in Section 5.1.1 and has been employed there to understand 
the origin of the hump-backed plateau. 

Related to this concept is the preconfinement property of parton showers. It 
implies that at every scale Qo, the colour structure of the parton shower allows the 
decomposition into colour singlets with an asymptotically invariant mass distribu- 
tion [151, 749], which depends on Qo and a hadronic scale A only, but not on the hard 
scale Q of the process initiating the parton showers. In fact, if Qo is much larger than 
typical hadronic scales, Qo >> A, the mass distribution of colour-singlets can be calcu- 
lated perturbatively as well as their momentum and multiplicity distributions, which, 
however, depend on Q: the net result is that the mass distribution exhibits a strong, 
power-like suppression at large masses and the asymptotic multiplicity distribution is 
given by a universal function of n/(n), a property known as KNO scaling [688]. 

The best way to see how this works is to use the large-N, limit for the parton 
shower. In this limit, the emission pattern of the shower can be arranged as a planar 
graph, with neighbouring partons carrying colour—anti-colour quantum numbers. This 
idea is at the hard of cluster fragmentation models, see Section 7.3.4. 


7.3.1.4 String effect 


A somewhat complementary picture is based on the very property of QCD itself, which 
is characterized by the gluons carrying colour charges in contrast to the Abelian theory 
of QED, where the photons are charge-neutral. As a consequence, the Coulomb 1/r 
form familiar from QED is not realized for QCD, although the potential obtained from 
the one-gluon exchange approximation actually is of this form. Essentially, this implies 
that the form of the potential cannot be obtained from naive perturbative expansions. 
Based on purely gluonic dynamics, at large distances the QCD potential between 
a static quark—anti-quark pair increases linearly and thus fulfils the confinement 
criterion put forward by Wilson [894]. Such a linear form of the potential is supported 
by observations, exemplified by fits to the masses of heavy quarkonia.? It is interesting 
to note in passing that the linear part of the potential can be related to the linear 
increase of the asymptotic Regge slope parameter a’, cf. Eq. (7.10). 

This leads to an interesting effect, which can be best described as the QCD ana- 
logue of the Nielsen—Olesen string [785] between two magnetic monopoles in a 


3For example, the relatively simple but hugely successful Cornell potential [496] reads 
V(r) = -2 + or, (7.52) 
r 


with the Coulomb parameter « and the string tension ø fitted to quarkonia masses. 
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Fig. 7.15 Field lines in electromagnetic (left) and strong (right) inter- 
actions, spanned by two poles, like static electron—positron and quark—- 
anti-quark pairs. While in electrodynamics the field lines occupy all space, 
with a strength decreasing with the distance from the poles, in strong in- 
teractions they “bunch up” in a flux tube with finite diameter between the 
two poles. 


superconductor. In contrast to electrodynamics, where the field lines of the electric 
field between two monopoles extend to infinity, in QCD the charged gluon field pulls 
itself together into a one-dimensional string between the two poles, as illustrated in 
Fig. 7.15. 

This effect of forming strings between colour charges is the non-perturbative con- 
tinuation of the drag effect discussed in Section 5.1.1. As a result, hadron production 
during hadronization closely follows the topology defined by the strings defined by the 
colour connections: as a result, hadrons in Mercedes-star ggg and qĝy events in e~ et 
collisions are produced, for the former, along the string spanned by the colour-ordered 
line ggg and not between the quark and the anti-quark, or, in the latter case, they are 
produced between the quark and the anti-quark, but away from the photon. 

This very idea of hadron production along colour-lines is essentially non-local: in 
the string picture, hadrons are also produced in phase space regions, where no parton 
acts as a seed. This, however, is mainly true for relatively soft hadrons. The string 
picture serves as the starting point for a simple model of hadrons and for a very 
successful model of hadronization, as discussed below. 


7.3.1.5 Dealing with baryons 


Before going into any more detail concerning various fragmentation models, it is worth 
to briefly discuss the hadronic degrees of freedom that will be produced. While usually 
fragmentation functions describe only the inclusive production of a specfic hadron from 
a given parton, the more involved models encoded in event generators turn all partons 
into an ensemble of hadrons. This means that some underlying idea must be developed 
of what actually constitutes such a hadron. The answer to this, in the framework of 
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the models that will be discussed below, is relatively simple: hadrons are bound states 
of quarks and antiquarks. 

In the case of mesons, this is fairly obvious and straightforward to realize. They are 
made of quark—anti-quark pairs such that one can write Ym = |q1q2) for their flavour 
part, and the only non-trivial issue is related to the flavour wave functions of neutral 
states such as 7 or 7’ which have some mixing in flavour and are thus a superposition 
of |uū), |dd), and |ss). Practically speaking, since the parton shower implicitly also 
acts in the limit of infinitely many colours, Ne — oo, every colour degree of freedom 
will have one and only one partner with the corresponding anti-colour. In a world 
without gluons, the colours would be carried by quarks and the anti-colour would be 
carried by anti-quarks, and there would be unique pairings of both. Any of these pairs 
would then carry the flavour quantum numbers of mesons. 

In contrast, in the large-Ne limit, the composition of baryons is not entirely straight- 
forward. As baryons consist of three constituent quarks, for Ne = 3 their colour indices 
are completely anti-symmetric, realized by a Levi-Civita tensor in colour space. This 
is how they form an overall colour-singlet. For Ne — oo such a simple reasoning of 
course would not work any more. Instead, usually the models underlying hadroniza- 
tion resort to the notion of diquarks, hypothetical bound states of two quarks or two 
anti-quarks. For large Ne, the colour state of two such combined quarks, a sextet, can 
be re-interpreted as an anti-triplet, which allows the formation of binary bound states 
of a quark and a diquark, like a meson. The flavour part of suitable baryon wave-- 
functions, expressed as quark—diquark bound states has been discussed, for example, 
in [727]. In the hadronization models implemented so far, the diquarks come as spin-0 
and spin-1 particles, which is what can be expected from an s-wave system made from 
two spin-1/2 objects. In order to maintain Fermi statistics, though, spin-0 diquarks 
can only exist for two different quark flavours. 

Diquarks are non-perturbative objects which act as some mnemonic device to pro- 
duce, carry and trace baryon number. Consequently, they are not thought to be pro- 
duced in the perturbative phases of event generation, in particular the parton shower, 
but rather in those phases that are characterized by scales around typical hadron mass 
scales. This in turn means that usually only diquarks consisting of two light quarks, 
u, d, or s, are being produced, which have constituent masses up to the order of 1 
GeV. The absence of diquarks containing one or two heavy quarks, c or b, implies 
that in typical simulation programs doubly or triply heavy baryons are absent. On the 
other hand, this restriction also implies that ordinary heavy baryons, such as Ae or 
Ay, consist of a heavy quark and a light diquark, a picture that is qualitatively well 
aligned with similar picture in heavy quark effective theory in which heavy hadrons 
consist of a heavy quark and some “brown muck” around them. 

Various other ways to generate and trace baryon quantum numbers have been 
suggested, for instance by identifying Y-shaped string junctions where a “baryon” 
centre plays the role of a Levi-Civita tensor and is connected to three quarks through 
strings [854], but they will not be discussed further here. 
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7.3.1.6 The limitations of phenomenological hadronization models 


First of all, and most obviously, phenomenological fragmentation models, like the 
Feynman-Field model, the string or the cluster fragmentation models, do not respect 
quantum mechanical principles by constructing amplitudes and adding them before 
squaring. Instead they model and employ probability distributions right away through 
a suitable ansatz. This of course means that quantum effects, like for instance correla- 
tions due to the bosonic or fermionic nature of the hadrons, cannot be accounted for 
in a coherent fashion. 

In addition, all models to-date are more or less entirely driven by a limited set 
of qualitative construction paradigms but lack a more quantitative insight. As a con- 
sequence they rely, probably overly so, on the introduction of an increasing number 
of parameters to fit the data. This fitting to data, often called tuning, is exclusively 
performed based on LEP data from e~e* annihilations at Eems = 91.2 GeV, due to 
their high quality and the very large statistics. The resulting model parametrizations 
are then usually used verbatim not only for other c.m.—energies, but also for different 
processes, in particular for pp and pp collisions at the TEVATRON and the LHC exper- 
iments. While this faith probably is a consequence of the lacking ability to perform 
detailled checks of the assumption, e~ e+ annihilations and hadronic collisions are dif- 
ferent enough to raise some questions. While the former will always involve a colour 
dipole spanned by a qq pair plus subsequent emissions, the latter have a much more 
complicated colour structure and are often driven by gluons. Furthermore, the latter 
do have energetic additional sources of colour — the beam remnants. The possibility 
of the underlying event ultimately results in very complicated colour connections going 
back and forth between the beam remnants, including the other final-state particles, 
and possibly overlapping. This can and probably will mean that average distances be- 
tween partons show larger fluctuations in hadronic collisions compared to the ones in 
ee? annihilations, which are solely driven by the self-similar structure of the parton 
shower. 


7.3.2 Fragmentation functions 
7.3.2.1 Definition and evolution equation 


The most straightforward way to define fragmentation functions is by considering 
the cross-section for the production of a hadron h in e~e* annihilations to hadrons, 
proceeding through a Z boson or a virtual photon for the process e~et > y*/Z > 
h + X. Introducing, in analogy to Eq. (5.317), the energy fraction 


2E 
c= € 0) (7.53) 

V/s 
of the hadron with respect to the c.m.-energy, and its angle 0 with respect to the 
electron beam axis, allows to write the differential hadron production cross-section as 


1 da! 3(1 + cos? 6) p in? 
= F. i ek Soe F . 
Oete--+qq ded cos 0 8 p(x) + 4 pz) + 4 "4 (2) 
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Here, the dependence of the fragmentation functions on the fragmentation scale py 
has been suppressed. The subscripts T and L refer to the transverse and longitudinal 
fragmentation functions representing the contributions from the corresponding boson 
polarizations, while A marks the parity-violating term from the interference of vec- 
tor and axial-vector parts, yielding an asymmetric fragmentation. Integrating over 0 
yields the total fragmentation function F” = F} +F h which can be related to the 
parton fragmentation functions (often called fragmentation densities or just 
fragmentation functions) Daj; through 


1 
1 do? dz s £ 
i —— = F(z, p*) = [So ss => } Dai 7.55 
Tete->q] dz (x, H ) > z Z, Qs, u? h/ A na 3 ( ) 


where the coefficients are 


Ci (« Qs, =) = gils) (1 — z) + O (as). (7.56) 


In this expression, the g;(s) signify the couplings of the quarks to the y*/Z, and 
are given by linear combinations of their electrical, vector, and axial charges and the 
corresponding boson propagator terms. At leading order, the Cy, for quarks and anti- 
quarks are identical, and the contribution for the gluon C, only emerges at O (as), 
cf. [783]. Written in this form the analogy of the fragmentation functions F' with the 
structure functions fF 2 in DIS and of the parton densities f;/; with the fragmentation 
densities D} j; becomes fairly striking. 

The fragmentation functions (FFs) Dp/;(z, p’) encode, at leading order, the prob- 
ability that a hadron A can be found in the hadrons stemming from the fragmentation 
of parton i, carrying a light-cone momentum fraction z of the parton. This is in com- 
plete analogy with the parton distribution functions (PDFs). In full analogy to the 
PDFs again, the FFs experience logarithmic scaling violations. They depend on the 
fragmentation scale u? as expressed through the evolution equation through 


IDjji(x, u’) dz p 
E o -5j Pji (z, as) Drjj (Fu ) 5 (7.57) 


in parallel to the DGLAP evolution equations for the PDFs encountered, for instance, 
in Eq. (2.31). 


7.3.2.2 Parameterizations 


Similar to the PDFs, discussed in Chapter 6 and in particular in Section 6.2.2, the 
evolution equations in Eq. (7.57) are used to connect data with the form of the FFs 
at some fixed lower scale. This lower scale jo typically is of the order of a few Agcp 
for light flavours like u, d, or s quarks, or gluons and of the order of the heavy quark 
mass in the case of c or b quarks. A typical ansatz, especially for light hadrons and 
flavours for the FFs at this scale is of the form [279, 624, 687, 697] 
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h h 
Dnjilz, we) = Nix (1—2)**, (7.58) 


but also slightly more complicated forms have been proposed, see [447, 448]. This is 
especially true for heavy quark fragmentation. 

Typically, a number of symmetries are assumed, for instance a symmetry between 
particles and anti-particles 


Dy, hil Ho) = Dy, njīl2 Ho). (7.59) 


Similarly, for instance for the case of pions, the suppression of “sea-quark” formation 
of pions out of the “wrong” quark type is often assumed. Therefore, inequalities like 
the following ones are expected to hold 


Dr+jalz, ua) = Dr+/s < Drea Lo) im Dr+jā 


2 2 (7.60) 
Dr+jalz, Ho) = Dr+jaa < Dx+/ul% uo) < Dxt+/s- 


Here the latter inequality in the second equation reflects the suppression of secondary 
s3 production formation in the fragmentation process: in order to produce a K* out 
of a u quark, the strangeness quantum number of a § quark must be produced through 
non-perturbative ss formation. Due to the larger mass of strange quarks, this, however, 
is more unlikely than the corresponding non-perturbative uu formation in the case of 
a leading 5 quark fragmenting into a K+. This reasoning will be recovered under the 
keyword of strangeness suppression later in this chapter. Further assuming “valence 
enhancement” of the FFs, the idea that at large z the quantum numbers of the quark 
flavour components would dominate the fragmentation process motivates assumptions 
like [697] 


+ + + 
mt ant _ ar 
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Be’ = 6a = BK +1 = BS +2. 


(7.61) 


Of course, another constraint on the fitting procedure of such parameterizations is 
the momentum sum rule, 


fo P | = 1, (7.62) 
0 h 


which states that all hadrons stemming from the fragmentation of a single parton 
should carry its total energy. 

Actual fits to FFs at LO and NLO, similar to the ones for PDFs, have been 
performed by different groups. In Fig. 7.16 results for one set of these fits for charged 
pions [446] and protons [447] at NLO, at the low scale of up = 1 GeV are displayed; 
the fitting function the authors use is given by 
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Fig. 7.16 Fragmentation functions for charged pions (left) and protons 
(right) at next-to-leading order and wr = 1 GeV, with parameterizations 
taken from [446, 447]. 


Nz*(1— 2)? [1+ 7(1— 2)°] 
B2+a,1+8)+yB2+a,1+8+6)’ 


Dnjg(2, ur = 1GeV) = (7.63) 


where B(a, b) denotes the Euler 8 function. 


7.3.2.3 Heavy quark fragmentation 


For the fragmentation of heavy quarks, different parameterizations of the fragmenta- 
tion function at low scales have been proposed, among them 


1 1 ¢« \? 
1 TE (Peterson et al. [804]) 
zZ zZ = 
Dyjolz, mo) x < (1-2) (Kartvelishvili et al. [653]) 


2 
bmi, 


) (Bowler [281]) 


z 

(7.64) 
but also the form in Eq. (7.58) has been suggested, for example by [407], or more 
complicated forms like the one by [411]. 


7.3.3  Feynman—Field model 
7.3.3.1 Description of the model 


The Feynman-—Field model of fragmentation [527], also known as independent 
fragmentation, goes a step further than the fragmentation functions introduced in 
the previous section, Section 7.3.2, in its description of the hadronization process. 
While the fragmentation functions are typically concerned with the transition of one 
parton into one hadron, they are not adequate to describe the production of the full 
hadron ensemble in a process involving partons in the final state, due to simple colour- 
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and flavour-conservation arguments: turning a colourful parton into a colourless hadron 
without any kind of compensation will violate colour conservation. At the same time, 
after a hadron has been produced from a given quark, its flavour is absorbed in the 
hadron. Since the strong interaction does conserve flavour, this means that some other 
flavour quantum number may be left over. As an example, consider the case, where 
a u-quark fragments into a 7+. With the flavour quantum numbers of the r+ being 
ud, the next hadron must take care of the d-quark that would guarantee that the 
produced and absorbed d is compensated for. Assigning the residual colour to this d 
quark would actually also account for this quantum number. In a nutshell, this is the 
reasoning behind the Feynman-—Field model. 

To be more specific, consider the case of et e~ annihilation into a qq pair, without 
the emission of gluons. The two quarks move away from each other, in their rest frame 
exactly back-to-back, and for simplicity their momenta can be oriented along the z 
axis. As a consequence of their relative motion, an increasingly strong colour field 
along the z direction evolves, in which secondary flavours can be produced in pairs 
g2q2. Now, one of the primary quarks, say qı will combine with the secondary g@2 to 
form a qig2 meson, and the process repeats itself with the remaining q2đ1. Assuming 
that the “primary” qig2 meson carries a momentum fraction € of pọ away (ignoring 
masses), the momentum of q2 is given by (1— £) pq and therefore the invariant mass of 
the remaining g2—q, system is reduced by the same factor. This process of “inserting” 
further pairs qiq; can continue, by forming higher-rank mesons q,q;, taking into account 
the reduced invariant mass of the leftover qq pair. In the Feynman-—Field model this 
process is completely recursive. For instance, the probability density in the momentum 
fraction €, or the probability that a specific quark transforms into a specific meson is 
the same for each meson production step. However, as the recursion progresses, and 
the invariant mass of the remaining pair becomes smaller, at some point the latter 
reaches typical hadronic mass scales. At this point, there are two viable choices in 
the Feynman-Field model: either the production of a meson is forced or this last pair 
is simply dropped, in both cases with momenta being adjusted by some reshuffling 
to ensure every meson is on a reasonable mass shell and total four-momentum is 
conserved. 

The produced mesons will have some transverse momentum with respect to the jet 
axis, the original quark momentum. In the Feynman-—Field model this is realised by 
assigning mutually compensating transverse momenta to the constituents of the sec- 
ondary qg-pairs with respect to the original quark—anti-quark axis (the z-direction). 
The transverse momenta of mesons is then obtained from the sum of the transverse mo- 
menta of their constituents. The differential distribution in these transverse momenta 
for the quarks is given by a Gauss-form, 


2 2 Tki 
7.3.3.2 Shortcomings 


There are a number of short-comings in the original version of this model, the most 
obvious one being the notable absence of any gluons in its original version. There, 
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hadronization proceeds fully recursively and through quark degrees of freedom only, 
giving rise merely to mesons, but not to baryons. The introduction of diquarks would 
remedy the latter point, without any real addition to the model, but the former point 
is a bit more tricky and was never really fully addressed since more involved models 
such as string or cluster fragmentation took over, see the next sections. These improved 
models did not suffer from some of the more fundamental issues the Feynman—Field 
model had by construction. 

There are also some issues with the way the splitting process progresses. Due to 
the form of the probability density p(n) for the momentum fraction 7 = 1 — € of 
the residual quark chosen in the original model, the model is sensitive to whether the 
production of mesons occurs from the q or the g end.* 

Finally, note that the way it has been formulated, the model is not Lorentz-invariant 
and therefore the result has some dependence on the actual frame the hadronization 
is performed in. 


7.3.4 Cluster fragmentation 
7.3.4.1 Underlying idea 


Cluster models, which are mainly based on the idea of preconfinement, have been 
discussed very early [899] and have been implemented in the framework of an event 
generator for e~e* annihilations into hadrons in [528, 533]. They are the model of 
choice for the simulation of hadronization in the event generators HERWIG, HERWIG++ 
and SHERPA. 

The key idea in this class of models was to enforce non-perturbative splittings of all 
gluons into quark—anti-quark pairs at the end of the parton shower. As a consequence, 
since the model is formulated in the Ne — oo limit, colour-singlet clusters are formed, 
which consist of predominantly neighbouring quark—anti-quark pairs. The clusters will 
have the flavour quantum numbers given by the quark—anti-quark pairs, of mesons or, 
if they consist of a quark—diquark pair, of baryons. Once formed their masses are 
distributed in a continuous spectrum. The typical mass of these objects is relatively 
low and driven by the infrared cut-off Qo of the parton shower and by the masses of 
their constituents. While the bulk of low-mass clusters will be reinterpreted as actual 
hadrons, and enter hadron decays after some momentum reshuffling to force them onto 
their mass-shell, the high-mass tail of the distribution is seen as a washed out spectrum 
of excited hadrons. These clusters will decay into further, lighter clusters until they 
reach the mass scale of hadrons. This leads to a distribution of primordial hadrons 
that closely follows the pattern resulting from the parton shower, a manifestation of 
local parton—-hadron duality (LPHD). 


4In the original conception of the model in [527] it was noted that the energy distribution of 
primary mesons hints at p(7) peaking close to 1, the distribution in the meson momentum fraction 
€ must be peaked at low € to agree with data. However, the inclusion of gluon emissions certainly 
improves the situation, but of course it would mean that a way to deal with gluons inside this model 
must be defined. 
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Fig. 7.17 Distribution of primary cluster masses excluding their con- 
stituent masses, Motus = 4/mi.,, — M? — m2 for clusters with constituents 
1 and 2, in e~e* annihilations at various Eems (left), and the multiplicity 
distribution for primary clusters (right). The exemplary results shown here 
have been obtained from SHERPA. 


7.3.4.2 Cluster formation 


In a first step in cluster fragmentation, the gluons must decay into quark—anti-quark 
pairs or, if the phase space allows it, into the usually somewhat heavier diquark pairs. 
There are two ways this is achieved in practical implementations. Either, as encoded in 
the HERWIG and HERWIG++ realizations of the cluster fragmentation idea, the gluons 
acquire a non-perturbative mass, which allows a two-body decay in the rest frame 
of the now-massive gluon. This gluon mass (m,) then becomes a parameter of the 
fragmentation model that has a direct impact on what flavours are actually produced 
in the transition from the perturbative to the non-perturbative regime. Usually, mg ~ 
1GeV. With quark constituent masses around Agcp, Mua © 350MeV and m, & 
450 MeV, the gluon decays happen just above the threshold and, consequently, there 
is not much phase space left for the quarks. As a result, isotropic decays of the gluons 
will lead to the produced pairs to follow the gluon direction to good approximation, 
thus encoding LPHD. Alternatively, in SHERPA, the gluons are kept massless and decay 
by borrowing some four-momentum from a colour-connected spectator parton. 

For both HERWIG++ and SHERPA, the availability or lack of phase space for the 
gluon decay dictates the available flavours and influences the actual flavour of the quark 
or diquark pair into which the gluons decay. While in HERWIG++, only light quark pairs 
are accessible, due to the relatively low non—perturbative gluon mass, in SHERPA all 
flavours can eventually be produced. To exclude the soft, non-perturbative production 
of heavy flavours, c and b quarks or diquarks containing them, they are explicitly 
disallowed. In addition, “popping” probabilities P_,;7 modify the relative abundance 
of the permitted flavours f in the non-perturbatively enhanced gluon splitting process. 
These quantities will reoccur when the decays of clusters will be discussed. They enter 
the HERWIG+-+ as well as the SHERPA cluster fragmentation model. 

After the gluons have been decayed, primary clusters are formed from the unique 
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colour—anti-colour pairs, by combining the corresponding quarks or diquarks, which 
thereby also determine the cluster masses and momenta. 

In the left panel of Fig. 7.17, the mass distribution of primary clusters produced 
in e~e* annihilations is exhibited, peaking strongly at masses of around Melus œ% 1 
GeV, regardless of the c.m.-energy of the events. The near-identical shape of the mass 
distributions is a result of the independence from the hard scale of the process of the 
parton configuration at the end of the parton shower. It is interesting to note though 
that, trivially, mass thresholds related to the different parton species become visible, 
most prominently those of clusters carrying charm or bottom quantum numbers. They 
vanish more or less completely though when considering cluster masses with the con- 
stituent masses subtracted. In the right panel of Fig. 7.17, the cluster multiplicity 
distribution are shown for different c.m.-energies. The mean cluster multiplicity (n) 
increases with energy, but the normalized distribution n/(n) approaches a universal 
function for large energies. 


7.3.4.3 Cluster decays to hadrons 


Once primary clusters have been formed, many of them have masses close to the mass 
of primary hadrons with identical flavour quantum numbers. This is motivation to 
directly identify them with such hadrons and to compensate for the mass difference 
between the cluster and the hadron by reshuffling momenta in the event accordingly. 
This identification depends on the primary hadrons included in the simulation, with 
the heaviest ones introducing a scale for such transitions to be allowed. 

Other clusters may be slightly too heavy for such a transition, or there is no 
primary hadron with a sufficiently similar mass. In cluster fragmentation models, such 
relatively light clusters will decay into pairs of hadrons, C —> h hz. The mechanism, 
simply put, is that a flavour pair ff (quark—anti-quark or anti-diquark—diquark) is 
non-perturbatively produced to form new flavour—anti-flavour pairs together with the 
original cluster constituents Fi Po, Fy f and fP, which form the flavour part of the 
hadronic wave functions (or parts thereof, in case the flavour wave functions have 
more then one component, like neutral pions). The probablity of picking a hadron pair 
is given by considering the available spin degrees of freedom of the hadrons, ns, the 
phase space for the binary decay, the overlap of the flavour configurations F} f and 
fF with the hadron flavour wave functions, |(F(f)|¥)|?, and the “popping” rate for 
the flavour f, P_, ¢;, 


Dap? aa 3,2 
hı) p (he) ricco my m3) + 4mjm5 
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This is, possibly, further modified by multiplet weights reflecting the fact that for 
given flavour configurations there may be different viable hadrons belonging to dif- 
ferent multiplets which are defined by their spin and excitation quantum numbers. 
Correspondingly there will be relative weights for the pseudoscalar, vector, etc.. me- 
son multiplets. This effectively leads to an altered production rate of hadrons from 
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heavier multiplets with respect to the lighter ones. This becomes necessary because 
the large available phase space for lighter hadrons is usually not correctly compensated 
by the increasing number of polarizations for heavier hadron species. 

Having selected the hadrons, the only missing input for the cluster decay is their 
orientation. In the original version of cluster fragmentation models, the cluster decays 
have been isotropic, but this does not correctly cover leading-particle effects, in the 
xp distribution of hadrons. As a consequence, also highly non-isotropic decays are 
used now, especially for those clusters where one of the constituents stems from the 
perturbative phase of the event simulation and could thus be identified as a leading 
parton. Naively, this means that the decay contributions must peak for vanishing 
transverse momentum of the outgoing hadron with respect to the cluster axis, typically 
with a form like 1/k” or a Gaussian. This introduces a parameter determining the 
strength of the peak, in addition to some parameter that defines the scale up to which 
C — hh decays occur. 


7.3.4.4 Fission of heavy clusters 


Finally, there are those clusters which are too heavy to either transit into a single 
primary hadron or decay into two. These clusters will also decay, but into final states 
consisting of one or two new clusters, such as C —> C Cg or C > C'h. Again, a 
new flavour pair ff needs to be created and kinematically distributed, with different 
algorithms. 

In the HERWIG and HERWIG++ versions of the cluster model, the masses of the two 
new clusters are selected according to 


Mig = m2+(M—miz—m;)#", (7.67) 


where M is the mass of the original cluster, # is an isotropic random number, # € 
[0, 1], and P is a parameter which can be chosen differently for clusters contaibing only 
light flavours, bottom, or charm. For clusters containing a beam remnant particle, the 
mass of the new cluster with the beam remnant parton — or possibly the masses of 
both clusters — are chosen with an exponential distribution. In any case, a hard veto 
ensures that the chosen cluster masses are larger than the sum of their constituent 
masses and smaller than the original cluster. The new clusters are distributed either 
isotropically or with preferred directions, following the original hard partons from the 
parton shower. 

In contrast, in SHERPA the four-momenta of all four constituents are fixed by first 
choosing the flavour of the new pair and their transverse momenta, which directly 
translates into the transverse momentum of the new clusters. This choice is made 
according to the same Gaussian distribution as in the case of cluster decays to hadrons. 
Then longitudinal momenta fractions of the new clusters with respect to the two 
original constituents are chosen, with a function of the form 


f(z) = 271-2)8. (7.68) 


Together with the transverse momentum this fixes the masses of the new clusters 
which can then be formed by merely arranging the four constituents. 
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However, for the new clusters, again decisions of how they decay further must be 
made, along the lines just described. 


7.3.4.5 Further thoughts 


There are some interesting features of cluster fragmentation models that could pave 
the way for further insights into the dynamics of the hadronization process. 

First of all, they thrive on the non-perturbative splitting of the gluons, predomi- 
nantly produced during the parton shower, into quark or diquark pairs, which usually 
have an electric charge. In addition, a good fraction of the clusters produced and decay- 
ing is also charged. There are therefore various phases in cluster fragmentation where 
charged particles are produced or decay, processes which typically trigger QED radi- 
ation. While such effects have not been implemented in cluster fragmentation models 
to-date, one may speculate in how far they could account for the puzzle of the analy- 
ses [104, 105] by DELPHI, which find a significantly larger number of soft photons in 
ee? annihilations to hadrons than expected. 

Another interesting aspect of hadronization studies at LEP experiments concerned 
the level of flavour-correlations in hadron production, exemplified by OPAL’s mea- 
surement of the AA correlations in e~e+ to hadrons at the Z pole [101]. Usually 
the diquarks responsible for the creation of baryons are produced in late stages of 
hadronization, when clusters or strings decay and produce hadrons. This yields a fairly 
strong correlation in phase space between the diquark pairs, and, correspondingly, also 
between the baryons. One way of softening these correlations is to also allow diquarks 
to be produced in earlier stages of the fragmentation, in the non-perturbative decay 
of the gluons during cluster formation. 


7.3.5 String fragmentation 
7.3.5.1 Hadrons as strings 


In contrast to cluster models, which employ some intermediate stage of clusters be- 
tween the perturbative phase of hadron production and the primary hadrons, string 
models such as the Artru-Mennessier [172] model or the more sophisticated Lund 
model [162, 163, 849] are based on a direct translation of partons into hadrons. The 
underlying idea is relatively simple: the self-interacting low-energy confining modes 
of the gluon fields can be thought of as a flux tube developing between two oppo- 
sitely charged poles. At large distances the confining QCD potential assumes a form 
where it increases linearly with distances, V(r) ~ —or, where the string tension 
o ~ 1GeV/fm ~ 1GeV?. 

In this model, hadrons can be understood as strings spanned between massless 
quarks [162]; subjecting a classical massless quark-anti-quark pair to such a potential 
will result in a constant attractive force between them and an oscillatory movement. 
To see this, assume the two quarks are at rest and away from a common origin by a 
distance ro. The overall system will then have a total energy given by its potential 
energy only, Erot = V(2ro) = 2Eo = 2øro, which could also be interpreted as the 
mass of the system, m = 2Eọ. Assuming the quarks need no time to accelarate, they 
will move towards each other, with the speed of light, and on two light-cones t z, if 
they have been oriented along the z axis at the start. When passing the origin at time 
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t = ro = Eo/o, each will have energy and momentum Epo and, therefore, light-cone 
momenta of 2K). After another time ro they will have swapped positions and again 
be at a distance of 2r9. This “yo-yo” motion will repeat itself and after a total time 
T = Aro the quarks are back to where they started. The total area A covered by the 
string during that time is given by eight right-angled triangles with legs of length ro, 
thus 


2 
ee (7.69) 


A simple calculation shows that the area is Lorentz-invariant, for details cf. [162]. This 
motivates to identify bound states — hadrons, or, more precisely, mesons — of mass m 
with such configurations of (massless) qq pairs connected through a linear force-field 
and bouncing back and forth with the speed of light. 


7.3.5.2 Detour: An electromagnetic analogy 


Following the logic of [162], it is interesting to note the close analogy to electromagnetic 
superconductors. In order to work this out, remember two of many of their interesting 
properties. 

First of all, Cooper observed [412] that the exchange of phonons, lattice oscillations, 
induces a small attractive force between pairs of electrons close to the Fermi surface 
of a crystal. This leads to very loosely bound states, aptly named Cooper pairs, of 
size €, which often is macroscopic. This means that in such a case the Cooper pairs 
extend over many lattice spacings, and since they are bosons, they can occupy the 
same space’ and “condense” into the same ground state. However, if there are enough 
of such pairs, they form a charged Bose gas. The formation of the pairs opens a gap in 
the continuous spectrum of the electron gas that normally exists in metals, translating 
into a minimal energy to excite the electrons out of such states. As a consequence, the 
small excitations that characterize electron scattering and electric resistance become 
forbidden; the Cooper pairs can move freely in the crystal, which in turn becomes 
superconducting. This sketches the BCS theory [201, 202] of superconductivity. 

Second, applying a magnetic field to such a configuration will create a current 
of Cooper pairs, which in turn will counteract the original field and “expel” it from 
the metal. Any applied magnetic field will therefore have an exponentially falling 
penetration depth in a superconductor; this is the Meissner effect [758]. Increasing 
the temperature of the crystal will lead to the excitation of the Cooper states. When 
the temperature passes the critical temperature they will break up and not recombine 
any more, the crystal will become normally conducting again and the external magnetic 
field will start to permeate it. 

For a type I superconductor, € >> A. There will be an interfacial layer with a 
thickness of about À between the superconducting material and the external magnetic 
field where the latter one is practically non-existent, while the Cooper pairs, due 
to their finite and relatively large size, will be mostly found in the interior of the 
superconductor. To optimize the energy of such a system, the boundary region will thus 
assume a shape of minimal size with a thickness defined by a compromise between the 


5It should be noted, though, that they do not form stable bosons, as the electrons will enter and 
leave the energetically favoured Cooper pair situation. 
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quickly fading external magnetic field and the density of the Cooper pairs. Conversely, 
for a type II superconductor, € < A. The magnetic field will penetrate deeply into 
the superconductor and will have a large overlap with the region where the Cooper 
pairs are located. As a result, the volume and shape of the boundary region will be 
maximized. 


7.3.5.3 String breakup: Producing gg pairs 


Making the picture of hadrons consisting of gq pairs bound together by a string in 
a yo-yo mode more dynamical amounts to allowing them to have a larger energy, 
above typical hadronic scales. Then, the idea of identifying hadrons with strings can 
be turned into a fragmentation model. To see how this works, assume a primary pair of 
quarks is produced at the origin with a large c.m.-energy E. The quarks will move away 
from each other, back-to-back along, say, the z axis, and a string will be spanned by 
that movement. In the string hadronization model, hadrons are produced from such 
a configuration by multiple break-ups of the string through the production of new, 
secondary qq pairs in the strong field. These quarks, in turn, will form new endpoints 
of the smaller string fragments. 

The idea behind the emergence of such secondary pairs is that they are sponta- 
neously produced in one point, and then tunnel to have a separation large enough to 
be pulled apart by the strong field. The creation of n (ff) pairs in slowly varying 
static strong electric fields Æ proceeds through the Schwinger mechanism [841], 
which yields a production rate given by 


(eE)? Bl nm 
ere 5 -z exp a ik’ (7.70) 


Extending this to the case at hand, of a string break-up, is straightforward, since the 
string actually represents a linear potential, a constant field. As a consequence, the 
actual flavour of the qq pairs, whose production triggers the break-up, is determined 
by their mass, with relative probabilities given by 


am? 
Pag x exp | =-=] (7.71) 


with ø the string tension. Assuming quark masses as above (Mu, a = 350 MeV, m, = 
450 MeV, Mudy) = 700 MeV) and hadronic scales around Agcp for this parameter, 


o © Agcp/fm ~ 0.2 GeV”, leads to relative probabilties of 
Pua : Ps : Pua) ™ 1 : 0.3 : 0.003. (7.72) 


Allowing the produced pair to have a transverse momentum, and remembering the 
idea of a Gaussian distribution for the bulk of the produced hadrons, cf. Fig. 7.14 in 
Section 7.3.1, the expression above is replaced by 
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2 2 2 
Pag — Pog(pi) X exp (-2!) exp (=) = exp (=) . (7.73) 


This translates into a flavour—independent Gaussian distribution of transverse mo- 
menta, where the production of heavier flavours are exponentially suppressed such that 
charm and beauty are not being produced at all in the fragmentation. It is customary 
to fine-tune the distribution of flavours with additional suppression or enhancement 
factors for strange quarks, diquarks, and various flavour and spin configurations of the 
latter. 


7.3.5.4 String breakup dynamics 


Fig. 7.18 Break-up dynamics of a string: the string breaks at two points 
i and j, and the resulting quarks meet at point ij 


Consider the situation sketched in Fig. 7.18 where a quark q; from the break-up 
point 7 and an anti-quark q; from the break-up point j meet at a point ij. For simplicity 
only one spatial dimension x is taken into account, as this problem is practically 
(1 + 1)-dimensional. Since the massless quarks move with the speed of light in the 
linear potential V(x) = øx, their energies and momenta are given by 


Big = +o (£i j = Liz) and Pij = to(ti,; = ta) (7.74) 
and for the total system 
Ei; = Ei, +E; = o(a; — Tj) and Pij = Dit Dj = a(t; —t;). (7.75) 


The system is at rest, when its momentum is zero, or, in other words, if the two break- 
ups 7 and j happen at the same time t; = tj. In general the overall rapidity of the ij 
system is 


1, (ti 3) + (ti — ty) 
Yij = = log ; (7.76) 
7 2? (wi — 23) — (ti — ty) 
The requirement of a positive mass squared of the combined system, 
ms; = E? — Diy = o? (xi — rj) — (ti — t;)?] > 0, (7.77) 


translates into the requirement that the two vertices 7 and 7 are separated by a space- 
like distance and thus causally disconnected. This implies that all string break-ups 
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must be treated on an equal footing — there is just no temporal ordering like in 
the case of the cluster model, where clusters may decay into clusters. Every break- 
up partitions the original string into a left and a right-moving one (or, along the 
positive and negative light-cone axis). Taking into account only longitudinal degrees 
of freedom, the relation m? = ptp~ between the mass m and the light-cone momenta 
p% implies that choosing a mass for any string fragment leaves only one of the two 
light-cone momenta as free degree of freedom. This hints at a practical way of ordering 
the break-ups of the string into a sequence along one of the two light-cone axes, where 
hadrons of mass m are being formed. Labelling the end of the string where the break- 
up happens with a corresponding flavour, naively there are the following possibilities 
for a string break-up from one of its ends: 


string|qı] —> meson[q1 | + string[q2] 
string{qi] —> baryon[q (q2q3)] + string|(@9;)] 
string|(qig2)| —> baryon[(qig2)qs3] + string[gs] . 


The flavours and the transverse momenta of the hadrons are given by the Gaussian 
distribution in Eq. (7.73). The selection of the actual hadron is then further driven by 
multiplet-specific weights, corresponding to the same logic already encountered in the 
cluster fragmentation model. After this, the only quantity left to fix is the splitting 
parameter z of the longitudinal momenta between the hadron and the residual string. 
The overall result for this splitting, however, should be independent of which axis 
is chosen for the decays in the sequence. To implicitly guarantee boost invariance along 
the longitudinal axes, the corresponding light-cone momentum fraction z of the hadron 
is used. Demanding this independence leads to the Lund parametrization for the 
fragmentation function describing the process string — string + hadron: 


f(z) = yo exp (- om.) (7.78) 


zZ z 


where a and b are free parameters of the model and m is the mass of the hadron. 
Typically there are two sets of parameters a and b for strings with a quark and a 
diquark end, but this tends not to describe the production of heavy hadrons very 
well. In PYTHIA, therefore, more fragmentation functions, from Eq. (7.64), for heavy 
hadrons are also available. 

The splitting of strings into hadrons and smaller strings can be iterated until only 
a very light system, in the middle of the original string, is retained. For such light 
systems procedures similar to cluster decays into hadrons are usually employed, for 
more details cf. [853]. 


7.3.5.5 Kinky strings: Dealing with gluons 


Up to now, only strings emerging from qq pairs have been considered. When a gluon is 
inserted into the game, qgq, the colour flow is not a simple line connecting quark and 
anti-quark, but instead more complicated with q being connected to g, which in turn 
is also connected to qg. In such configurations, the gluon acts as a kink in the string, 
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carrying energy and momentum, and the simple yo-yo motion of qq pairs is replaced 
by a more intricate pattern. This picture implies that the gluon experiences twice the 
string force, since there are two strings attached to it, rather than the single strings 
attached to the quark. This also reflects the ration of Casimir operators of gluons and 
quarks, C4 and Cr with a ratio of C4/Cr = 9/4 ~ 2. As a consequence, hadron 
production is enhanced in the sectors of phase space spanned by the qg and the gq 
strings, while hadron production in the qq sector is massively suppressed. This is a 
direct consequence of the string picture, which can be viewed as an effective model of 
QCD at low energies in the limit of infinitely many colours, Ne —> oo. The resulting 
pattern of hadron production has already been anticipated in the discussion of the 
drag effect in Section 5.1.2. It is interesting to note here that for relatively soft gluons 
the kink becomes less and less prominent — the picture of gluons as kinks in a string 
therefore automatically provides a smooth connection with the parton shower. The 
actual treatment of these kinks in string fragmentation models is relatively subtle and 
the reader is referred to the original literature [849, 850], the PYTHIA manual [853], or 
the review [290]. 


7.3.5.6 Advanced features of baryon production 


The string fragmentation picture developed so far leads to a strong correlation of 
baryons in phase-space, since a diquark produced in a string break-up from which a 
baryon emerges leads to the corresponding anti-diquark as the new end point of the 
residual string. This strong correlation however is challenged by an OPAL measurement 
of AA correlations at LEP [101]. This re-inforced the idea of the so-called pop-corn 
mechanism [165], which relates the production of baryons to the popping of more 
than one qq pair. This proceeds in such a way that two quarks of two pairs conspire to 
form one diquark while two anti-quarks of two such pairs form an anti-diquark. This 
would effectively lead to configurations where one or two mesons are formed between 
two baryons, like BMB, BM MB, etc.., thereby decorrelating them. This picture is 
effectively realized by supplementing the potential string break-ups of Eq. (7.78) with 
yet another one, namely 


string[(qig2)] —> meson|q:g3] + string[(q2q3)] - 


7.4 Hadron decays 
7.4.1 General thoughts 


The overwhelming majority of all objects produced in particle collisions at a hadron 
collider like the LHC, that interact with a detector and therefore have a sufficiently 
long lifetime of cr > O (mm), are hadrons in the lowest-lying multiplets. These include 
charged pions and kaons, long-lived neutral kaons — the Kz — as well as nucleons, 
protons and neutrons. Most of them, and also most of the photons or leptons that 
accompany the hadrons, actually stem from the decays of hadron resonances or hadrons 
containing heavy quarks. It is therefore important to have some insight into how these 
heavy unstable objects decay and how these decays can be described theoretically. 
There are large numbers of known unstable hadrons, giving rise to a plethora of 
different decay modes and characteristics, which cannot be given justice by covering 
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them in any detail in a book like this — a quick glance at the PDG [229] is testament 
to the incredible amount of data and knowledge gathered up to today. So, instead of 
trying to have an exhaustive discussion, which would be beyond the scope of this book, 
a brief overview supplemented with references to a number of relevant review articles 
must suffice. The focus will be put on those particles and their decays that are phe- 
nomenologically most relevant. Typically these are the 7-leptons, hadrons containing 
open heavy flavour like B and D mesons, and quarkonia states such as the J/W. 


7.4.2  Decays of 7—leptons 


In the Standard Model the decays of the 7 lepton are solely mediated by the weak 
interaction, giving rise to a T-neutrino, vr in the final state and either a charged lepton 
and its anti-neutrino or hadrons. 


7.4.2.1 7 decays into leptons 


For the former, the fully leptonic decay, matrix elements at leading order can be written 
as the product of two left-handed currents, namely 


M00), = T (u AP un, ) (a. Yul us) = Edt Ly (7.79) 
This is the expression from Fermi’s theory of weak interactions, which rests on 
four-fermion interactions of the type above. It is an effective theory which emerges 
from the full SM by integrating out the W boson as heavy degree of freedom. This is 
well justified in the decay of light objects; merely analysing the W propagator and its 
couplings to the fermions, 


m e2 gi” p? Gr p? ) 
wo — +4 oO | == = — gt” + O| — 5 7.80 
2- mi, 8m? sin? Oy (2) V2 2 (4 (eap) 


Integrated over the phase space of the three outgoing particles, Eq. (7.79) yields the 
well-known partial decay width for weak decays of leptons, namely 


G? mË 
19273 


TD sy, bi, = 


2 

f (=) with f(x) = 1-— 8r + 8r? — rt — 1227 logx, (7.81) 
m7 

which gives rise of a branching ration of about 17 % per lepton. 


7.4.2.2 r decays into hadrons 

Similar reasoning can also be applied to the decay to hadrons. There, however, the 
leptonic current J” must be replaced by some hadronic current for the production of 
N hadrons hj, hog, .... Such hadronic currents can be written as 


eee = Vug (hae hy Ua Ve Ug 


0) l (7.82) 
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taking into account the fact that for simple reasons of energy and charge conservation, 
the only allowed combinations of quarks are ud and us, reflected also in the CKM—- 
matrix element in front of the current. 

In the simplest case, when only one pseudo-scalar hadron PS is produced, like a 
a or a K`, the current reduces to 


ete Ving (Ps Ua YF uq 


0) = = Wag Foss (7.83) 


with pps the four-momentum of the hadron and its decay constant fps, 
fr = 130.2(1.7)MeV and fx = 155.6(0.4) MeV. (7.84) 


This yields the partial width 


G2|Vugl? fe. m3 meai” 
Trv PS- = al il Tes (1 rts) ’ (7.85) 


or branching ratios of about 11% and 0.7% for the decay into a single charged pion 
or kaon. 

In principle, currents similar to the one in Eq. (7.83) could also be written for the 
production of vector or tensor particles, but in reality these particles typically decay 
fairly quickly and it is thus more useful to model currents with more than one hadron 
in the final state. Such more complicated final states in 7 decays consist of a number 
of pions, kaons and 7-mesons, and they typically exhibit relatively rich structures 
resulting from a variety of different resonances such as p’s, K*’s, a,’s, etc.. As an 
example, consider the next complicated case of the production of a charged hadron 
h-, such as at or a K~, accompanied by a neutral hadron h° like a 7°, K®, or 7. 


Then, 
») 


vy ad -p° =h? 
= V2V qq IG -n ) (on — ro.» FIP (a?) 4 git FEH (A)| , 
(7.86) 


Jt o = Vag (aor 


=z ab 
Ua Yr uq 


with q = pn- + ppo and where two form factors, Fy and Fs have been introduced. A 
typical ansatz to parameterize the form factors has been provided in [703], giving rise 
to the phenomenological Kihn—Santamaria model. It has been further refined for 
instance in [529, 706] and forms the basis for the successful TAUOLA package [646] and 
other implementations of r-decays in HERWIG++ [597] and SHERPA. In this ansatz, the 
form factors are composed of a number of Breit-Wigner resonances of the form such 
that, for instance, 


eG aS fy (7.87) 
K: T X av = mi, —s—imyTy(s)’ j 
V 
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where the factor ay define the relative weight of the contribution, the sum over V 
includes V € {p, p’, p”} and Ty (s) is the scale-dependent total width of the resonance. 
These terms, and in particular the relative weight a are typically fitted to experimen- 
tal data; in order to further improve the agreement, sometimes additional resonances 
are introduced in this kind of phenomenological model, which do not necessarily cor- 
respond to a known physical state. 

Alternatively, for the construction of these currents or the relevant form factors, 
low-energy effective theories of QCD could be invoked. The most appropriate candi- 
date appears to be chiral perturbation theory (PT); for a comprehensive review 
see [554]. The idea underlying this effective theory is to understand chiral symmetry 
as one of the fundamental symmetries of the SU(3)r ® SU(3)r Lagrangian with the 
quark mass terms breaking it [433-435, 553, 794]. This allows for an expansion in 
ratios of quark masses over typical momentum transfers in mesonic processes. This 
idea has been extended to also include resonances such as the vector mesons named 
above and their coupling to the pseudo-scalars in resonance chiral perturbation 
theory [494, 495], RyPT. 


7.4.2.3 Weak decays of heavy hadrons 


There is some similarity when leaving the decays of the 7-leptons and moving on to 
deal with weak decays of heavy hadrons containing c or b quarks. There are typically 
three decay modes associated with them that are phenomenologically relevant, namely 
purely leptonic decays of mesons of the form M — De, semi-leptonic decays of the 
form H —> De + X, where X denotes one or more hadrons, and fully hadronic decays 
H > hiha... hy. 


7.4.2.4 Fully leptonic decays 

At leading order, purely leptonic decays of charged mesons of the type MT —> Cig 
can be described by matrix elements based on a current—current interaction similar to 
the one encountered already in Eq. (7.79), 


Gr e 
Mmeo = z Var (o Ug Yul Ug’ 


V2 
where q and q’ are the constituents of the meson M. For pseudo-scalar mesons PS, 
which as constituents of the lowest lying multiplet are the ones that typically decay 
through the weak interaction, again the hadronic part of the matrix element can be 


replaced with Eq. (7.83) involving the decay constants for the heavy mesons, summa- 
rized in [799] as 


m) (iert uw) , (7.88) 


fp+ ~ 211.9(1.1) MeV, fp, ~ 249.0(1.2) MeV, 
fet ~ 187.1(4.2)MeV , fgo œ~ 190.9(4.1)MeV, and fp, ~ 227.2(3.4) MeV. 
(7.89) 
The corresponding branching ratios are given by 
TPs = Gh fpsmesmi Vag | (1 ue i‘ (7.90) 
ST mps 
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Of course such decays of the light pseudo-scalars 7* and K~ typically do not play a 
role for LHC physics, since the weak interaction leads to a long lifetime of these objects 
which in turn usually reach the detector. However, the very same decay modes of heavy 
mesons are relevant for a number of reasons. First of all, on their own they provide an 
interesting laboratory testing the interface of perturbative QCD and derived effective 
theories with lattice QCD and data and thus allow for the measurement of important 
quantities necessary for the description of the phenomenologically relevant rare decays. 
In addition, due to the presence of heavy quarks the weak decays would be potentially 
susceptible to interactions that are sensitive to the quark mass. They thereby allow 
for a relatively clean way to study, for instance, the effects of interactions mediated by 
charged Higgs bosons. The case of a B meson decay illustrates this, where the inclusion 
of charged Higgs bosons would modify the above partial width by a factor [635] 


2 2 
r= (1 5 tan? g ES ) l (7.91) 


HE 


7.4.2.5 Semi-leptonic decays 


The analogy with the treatment through current-current interactions similar to the 
decays of r decays is also manifest for semi-leptonic decays of the type H —> 4De + X. 
These decays are often used to tag heavy flavours produced at relevant hard processes 
at larger scales, and they also allow for the determination of form factors relevant 
for rare decays, see below. Depending on the hadronic final state X, different con- 
siderations for their actual evaluation will apply. Consider first the case of exclusive 
semi-leptonic weak decays, where X is a specific hadron, X = h, like for instance 
in the B-meson decay B —> pD. In such cases the matrix element reads 


Munn = E Var (n H) (« 7 un) = SERI, (192) 
and, once again, one is left with the determination of the hadronic current. Similar 
to the case of fully leptonic decays, though, also semi-leptonic decays of most light 
hadrons are not relevant for LHC phenomenology. This is due to the relatively long 
lifetime of the involved hadrons, which, again, would consequently arrive at the detec- 
tor to decay in it. For the few relevant decays of, e.g., Ks, XPT and other effective 
theories or sum rules help to constrain or evaluate the hadronic decay currents. 


Ug YuL Ug! 


7.4.2.6 Heavy quark symmetry 


In contrast, in the case of transitions from one heavy quark to another, like in the 
previous example, an interesting symmetry of QCD comes to help: the heavy meson 
in question can be seen as the quark surrounded by some light “brown muck” dragged 
along with it. Since the weak decays happen on time-scales shorter than the scales 
dominating the bound state, and as long as it continues to be dragged along at about 
the same velocity, this muck will, at leading order, not resolve the decay or the differ- 
ence between heavy quarks of different flavour and with different mass. This idea can 
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be formalized as heavy quark symmetry [784]. There it gives rise to an expansion in 
Aacp/ma, aptly known as heavy mass expansion, with mg the mass of the heavy 
quark or meson. Following these ideas it is possible to formulate an effective theory, 
heavy quark effective theory (HQET) [497, 567, 736], which is particularly pow- 
erful for analysing form factors for hadronic currents like the heavy-to-heavy current 
in Eq. (7.92). Following ideas introduced in [847, 848], for pseudo-scalar mesons h and 


H, 
(n 


where €(v;, - vq) denotes the Isgur—Wise function [640, 642], which depends on the 
velocities vp and vq of the two mesons h and H. €(vv’) is normalized such that 


Ug Yu Ug’ 


H) = E(Un UH) (un + UH) p (7.93) 


E) = 1+ O(1/md) . (7.94) 


Similar equations also hold true for cases with a more complicated spin structure, 
cf. [784], where due to the heavy quark symmetry the form factors enjoy similar prop- 
erties and some remarkable relations among each other. 

It should be noted here that such an analysis can also be extended to decays of 
heavy baryons [568, 569, 643, 745] and to heavy-to-light transitions, to cases where 
h is a light meson, see [293, 519, 641] for some early work. In contrast, the hadronic 
state X in semi-leptonic decays can also be composed of a variety of hadrons, leading 
to inclusive semi-leptonic decays when all configurations are summed over. From 
a simple physical point of view such configurations stem from the fragmentation of the 
outgoing quark produced in the weak decay and the spectator quark or diquark which 
together with the decaying quark have formed the hadron. In this simplistic picture, 
the fragmentation does not change the partial widths on the parton level and the 
inclusive partial widths are thus given by the quark-level expression only, the quark 
current is not replaced by hadronic matrix elements. In principle this would allow for 
a fairly straightforward determination of relevant quantities describing the decay, like 
for instance the CKM element Vj, present in the transition amplitude of Eq. (7.92). 
In view of the larger number of form factors in heavy-to-light transitions this was 
thought to be particularly relevant for the case of Va» [639, 719]. The experimental 
difficulty in such a measurement, however, is to collect all relevant final-state particles 
and to cover all possible phase space for them. This poses a challenging problem at 
hadron colliders such as the LHC. 


6 As a first result of this symmetry consider for example the mass splitting between pseudo-scalar 
and vector mesons with the same flavour quantum numbers such as B* and B: 


mp» -mp x l/mp, 
and as a result the quadratic mass differences are nearly constant 


2 De ANE? PETNE? S 2 
mp -mp X Mpx -mp ¥ mp: — mp, ~ 0.5 GeV? . 
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7.4.2.7 Non-leptonic decays 


In principle, similar operators also account for the weak non-leptonic decays of hadrons 
on the parton level: looking at the current—current interaction in Eq. (7.79) one would 
only have to replace the -lepton with the corresponding decaying quark and the 
outgoing leptons and neutrinos with the quarks that are produced in the decay. How- 
ever, now the hadronic final state originates from the fragmentation of three outgoing 
quarks from the Fermi interaction plus the spectator. This renders the calculation of 
partial widths related to such decays a hard nut to crack theoretically. There are some 
simplifications, most notably in cases where the decaying quark is a heavy quark (c or, 
even better, a b quark). Some progress is possible by again using technologies deeply 
rooted in the fact that the heavy quark mass sets a scale mg allowing for an expansion 
in Agcp/mg [247, 248, 261, 388]. In the spirit of the treatment of exclusive processes 
with large momentum transfers in QCD through light-cone methods [726] and first 
worked out for the case of B — am decays, the current-current interaction can be 
decomposed — factorized — such that 


B) (x 0) i +X rna +O (=) (7.95) 


with perturbatively calculable coefficients r,, [212]. The picture underlying this factor- 
ization is as follows: due to the large mass difference between the incident B meson 
and the pions the qq pair forming one of them will be fairly collimated. As it is a 
colour singlet, soft gluons with momentum of the order of Agcp will not see it and 
thus decouple at leading order in Agcp/mg. Similar reasoning also holds true for 
the spectator quark, which in the B rest frame typically also carries momentum of 
order Aqcp. If it does not take part in any way in the hard interaction, it merely con- 
tributes the quantum number to the pion formed on the b-quark side of the current. 
This is then absorbed into a corresponding B — m form factor already encountered 
in the treatment of semi-leptonic decays. Doe to the size of this colour singlet and its 
composition, therefore the two pions at leading order factorize nicely. This reasoning 
could be extended to some degree to other final states, and in particular also to those 
containing a heavy meson instead of the pion [213]. 


Ju,0q 
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7.4.2.8 Rare decays 


A specific class of weak decays is represented by rare decays which, in contrast to the 
decays discussed up to now, cannot directly be identified with the tree-level exchange 
of a W-boson. In contrast to the this case, in which in the low-energy limit therefore 
becomes equivalent to the Fermi four-fermion interaction, the rare decays typically are 
mainly driven by loop-induced interactions. Fig. 7.19 shows some examples, where W 
bosons and additional quarks enter as virtual particles. These decays on the quark level 
quite often have the form of flavour-changing neutral current (FCNC) processes, 
where the overall charge transmitted from the heavy quark line to other final-state 
objects is zero, but the flavour of this heavy quark line changes. They may manifest 
themselves as purely leptonic decays, like for example Bs > u7 u™, the most prominent 
one at the LHC. In addition, there are also semi-leptonic decays, related to, e.g., b > sy, 
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Fig. 7.19 Some example Feynman diagrams for rare decays of heavy 
quarks: b — sy (left) and b — sé@ (centre and right). The graph in the 
middle motivated the slightly unintuitive name of “penguin-diagrams”. 


b — sll, b > ss3 etc.. Quite often these processes are also known as “penguin” 
decays. The most prominent one at the LHC to date are the decays B > K* u` u* and 
B, > dp u”, triggered by the quark-level b > sé@ transition. In addition, although 
not a decay, also mixing in systems of neutral mesons such as B°B°-mixing, falls 
into the category of such processes. 

In FCNC processes, the change of flavour quantum numbers along the quark lines 
typically stems from a loop consisting of W bosons and a combination of quarks — 
u, c, and t in the examples depicted in Fig. 7.19. By invoking the unitarity of the 
CKM matrix and by assuming massless charm quarks, the contribution of the c 
and u quarks can be summed and combined with the t contribution using the same 
combination of CKM elements V;,V,*.” This is the celebrated GIM mechanism [581]. 
As a consequence, rare processes such as b — sy are driven by the large mass hierarchy 
between the up-type quarks; conversely, similar processes such as c > uy are GIM- 
suppressed. 

The interest in rare processes is fed by the observation that typically the particles 
running in the loops are fairly heavy — W bosons and t quarks — and that similar 
loops could also originate from hitherto unknown heavy particles from new physics 
scenarios. Therefore, such rare processes allow indirect probes of physics beyond the 
SM. In order to systematically study such processes, operator product expansion 


TTo see how this works assume that the loop contributions to a process such as b + sy can be 
written as a linear combination of functions that depend on the ratio of quark and W-mass multiplied 
with the corresponding CKM elements. Then 
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where the unitarity of the CKM matrix has been used in the form 


Vot Vis , 
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(OPE) is frequently used [893], which rests on a factorization of short- and long- 
distance contributions to a given process. This results in operators describing the latter, 
multiplied by Wilson coefficients taking account of the former. The result of the 
loop and possibly higher-order corrections drives the coefficients, which are therefore 
the relevant quantities sensitive to potential new physics effects. The construction of 
this operator basis has been intensively discussed in the literature, for a comprehensive 
review cf. [289]. 


7.4.2.9 Aside: CP violation 


At this point it is interesting to point out that the complex phase in the CKM 
matrix induces CP violation in the SM. This manifests itself as differences be- 
tween the branching ratios or kinematic distributions of decays I — F of an initial 
state J into a final state F and the corresponding properties of the conjugate decay 
I + F. As direct CP violation this phenomenon may occur in decays of charged 
and neutral particles; it is triggered by having different amplitudes — quite often 
tree-level amplitudes and loop-induced ones — contributing to the same decay. Then 
different combinations of CKM elements may be invoked, some of which are purely 
real and some others exhibiting the weak phase. Since the weak phase flips sign when 
anti-particles are involved, different contributions may exhibit different interference 
patterns, thus yielding different total amplitudes and therefore different decay charac- 
teristics. Additionally, in neutral mesons, the phenomenon of M M-mixing introduces 
another source of CP violation. The textbook example is CP-violation in kaon mixing, 
parameterized by eg, which manifests itself by a small branching probability of Kg 
(Kz) mesons to decay into three (two) pions. Similar mixing also occurs in the neu- 
tral D°D®° and Ba,,Ba,, meson systems. Finally of course, CP violation can also be 
triggered through the interference of decay and mixing amplitudes. For a pedagogical 
introduction into this subject, cf. for instance the BABAR physics book [616]. 


7.4.3 Strong decays 


In addition to hadron decays mediated by the weak interaction there are also decays 
that proceed through the strong interaction, such as n > ama, ¢ > KK, or the de- 
cay of quarkonia like J/W to lighter hadrons. They all have in common that flavour 
quantum numbers are conserved, in contrast to the weak decays discussed up to now. 
Consequently, on the parton level, such decays could possibly be seen as the anni- 
hilation of the quark—anti-quark pair constituting the decaying hadron, mainly into 
gluons, which in turn would fragment into the observed final-state hadrons. For such 
decays of the lighter hadrons in particular, effective theories like chiral perturbation 
theory could be invoked again. 

Alternatively, for quarkonia — bound states of two heavy quarks — using per- 
turbative QCD is a viable option as well. Depending on the spin of the quarkonium 
in question, different decay routes would apply. For (pseudo-)scalar particles such as 
the ne decays typically would yield two gluons or photons in the final state while for 
(axial-)vector particles there are more parton level channels available. For instance, for 
the case of J/W this would include the annihilation of the cc-pair forming the J/V into 
a virtual photon, which subsequently would decay into a pair of leptons or quarks or 
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Fig. 7.20 Parton-level decay channels of J/W: annihilation into fermion 
pairs, annihilation into three gluons, two gluons and a photon, and into 
three photons (from left to right). 


the annihilation into three gluons, two gluons and a photon, or three photons, see also 
Fig. 7.20. By far and large, the rates calculated for leptonic, photonic and hadronic 
decays follow the pattern suggested by these parton—level considerations. A simple 
back-of-the-envelope estimate suggests that the branching ratio to a single lepton is 
given by 

e2a 7 ea 
Ca3(m,) + e2 a 2 ef 5a3(m_) + 4ae? 


BR yaaa x 5%, (7.96) 


where ee = 2/3 is the charge of the charm quark, and the factor C = 5 stems from 
the colour factor related to the transition of a singlet to three gluons including their 
symmetrization. The sum over accessible fermions includes e, u, u, d, and s for the 
case of charmonia, yielding a factor of four. This crude estimate of a branching ratio 
of 5% has to be compared with the measured branching ratio of about BRjjye = 
5.94% [799] for the decay of a J/W into one lepton flavour. It has to be noted, though, 
that this result of course depends crucially on the choice of scale in ag. 

Decays of this kind are important for two reasons. First of all, they provide a test of 
perturbative QCD at relatively low scales, which is interesting in its own right. Further- 
more, and probably far more importantly, at hadron colliders such as the TEVATRON or 
the LHC, the lepton pairs produced in decays of J/V, Y’, or T mesons, at well-defined 
and relatively sharp invariant masses of about 3.097 GeV, 3.686 GeV, and 9.46 GeV, 
respectively, allow, together with leptons from the Z boson, a calibration of the muon 
chambers and electromagnetic calorimeters. They therefore directly contribute to the 
overall success of every measurement with leptons in the final state. 


8 
Data at the TEVATRON 


In this chapter, many of the ideas developed in the book up to now will be put 
to the test, focusing on the comparison of QCD theory predictions from fixed—order 
calculations, analytic resummation calculations, and full hadron—level simulations with 
data from the TEVATRON era. Not only did the experiments at the TEVATRON test the 
theory of QCD over a wide range of scales, but in addition they also probed interesting 
non-perturbative aspects such as soft QCD interactions and multiple parton—parton 
scattering. While such an environment is very close to the conditions encountered at 
the LHC, the experimental data discussed here have been taken at much lower energies 
and in a comparably-reduced phase space. 

Nevertheless, the goal of this chapter is to appreciate the precision of theory and 
experiment achieved at the TEVATRON, that was instrumental in inspiring a profound 
faith in the technology that is now employed at the LHC. We will review some of 
the salient analyses at the TEVATRON, dealing with many of the same processes that 
will also be discussed for the LHC. The aim is not to present a complete review of 
TEVATRON physics, including all of the most up-to-date results, but to discuss those 
physics topics which will serve as a good pedagogical introduction to QCD physics at 
the LHC. This will help to set the scene for Chapter 9, where results from Run I of 
LHC will be confronted with theoretical predictions in the same spirit. 


8.1 Minimum bias and underlying event physics 
8.1.1 Minimum bias events 


The overwhelming majority of the events produced at the TEVATRON are due to pe- 
ripheral collisions of the proton and anti-proton, the so-called minimum bias events. 
These events occur at such a high rate (50 mb) that it would be impossible to record 
all of them. However, through the use of a highly pre-scaled trigger, a small fraction 
of these events are saved. The events contain interesting physics in their own right, as 
well as providing a means of monitoring the performance of the detector. In the high 
luminosity running typical of Run II at the TEVATRON, there are many such minimum 
bias events in every beam crossing, and the effects of such pileup must be dealt with 
for the precision determination of physics at higher scales. 


The Black Book of Quantum Chromodynamics: A Primer for the LHC Era. John Campbell, 
Joey Huston, and Frank Krauss. © John Campbell, Joey Huston, and Frank Krauss 2017. 
Published in 2017 by Oxford University Press. 
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Fig. 8.1 The track pr differential cross-section in the central rapidity 
region for CDF in Run I and Run II. Reprinted with permission from 
Refs. [74, 75]. 


The transverse momentum distribution for charged tracks with pseudo-rapidity less 
than 1 is shown in Fig. 8.1 for CDF in Run II, compared to the similar distribution 
in Run I [74, 75]. As expected, the majority of tracks are at very low transverse 
momentum. On average, there are slightly more than 2 tracks per unit pseudo-rapidity 
for events in which at least one track has a transverse momentum larger than 0.5 GeV. 
Tracks in the higher pr range are constituents of jets; as can be observed in the figure 
there is a larger cross-section for the production of these tracks at 1.96 TeV than at 
1.8 TeV. Fig. 8.2 shows that the track—pr distribution is reasonably well-described by 
PYTHIA using Tune A.! 

Late in its career, the TEVATRON carried out an energy scan, running at energies 
of 300 and 900 GeV, in addition to its normal Run II energy of 1.96 TeV. The data 
obtained from the relatively short runs proved to be very useful in tuning models for 
minimum bias production. The average particle density (dn/dn) for charged particles 


Tune A refers to the values of the parameters describing multiple-parton interactions and initial 
state radiation which have been adjusted to reproduce the energy observed in the region transverse 
to the jet. 
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Fig. 8.2 The track pr differential cross-section in the central rapidity 
region for CDF in Run II compared to the prediction from PYTHIA Tune 
A. Reprinted with permission from Refs. [74, 75]. 


with |7| < 0.8 and pr > 0.5 GeV is shown in Fig. 8.3 [72]. An extrapolation predicts 
a charged particle density of 3.1 at 7 TeV and 3.3 at 8 TeV. 


8.1.2 Underlying event 


Any event at a hadron-hadron collider consists of a hard collision of two incoming 
partons, with possible QCD radiation from both the incoming and outgoing legs, 
along with softer interactions from the remaining partons in the colliding hadrons. 
Such interactions represent the underlying event energy discussed in Chapter 2. A 
schematic depiction of this situation is shown in Fig. 8.4 for the case of a dijet event. 

The underlying event energy is due to the interactions of the spectator partons 
in the colliding hadrons. It results in an energy deposit of approximately 0.5 GeV 
for a cone of radius 0.7 and is similar to the amount of energy observed in minimum 
bias events with a high track multiplicity. The rule-of-thumb has always been that the 
underlying event energy in a jet event looks very much like that observed in minimum 
bias events, i.e. that there is a rough factorization of the event into a hard scattering 
part and a soft physics part [121]. 

Studies have been carried out with inclusive jet production in CDF, examining the 
transverse momentum carried by charged particles inside and outside of jets [121, 127]. 
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Fig. 8.3 The average track multiplicity distribution for CDF in the central 
rapidity region (|7| < 0.8), for tracks with pr > 0.5 GeV, as a function of 
the centre-of-mass energy. Reprinted with permission from Ref. [72]. 
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Fig. 8.4 Schematic cartoon of a 2 — 2 hard scattering event. 


For example, the geometry for one study is shown in Fig. 8.5, where the towards and 
away regions have been defined with respect to the direction of the leading jet. 

Of the two transverse regions indicated in Fig. 8.5, the one with the largest trans- 
verse momentum is designated the TransMAX region and the one with the lowest, the 
TransMIN region.” The transverse momenta in these two regions is shown in Fig. 8.6. 
As the lead jet transverse momentum increases, the momentum in the TransMAX 
region increases; the momentum in the TransMIN region does not. The amount of 
transverse momentum in the TransMIN region is consistent with that observed in 
high multiplicity minimum bias events at the TEVATRON. At the parton level, the 
TransMAX region can receive contributions from the extra parton present in NLO 


2A similar analysis was carried out in Run 1, using cones of radius 0.7, at the same 7 as the lead 
jet in the event and +90° in ¢, to define the MIN and MAX regions [127]. 
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Fig. 8.5 Definition of the toward, away and transverse regions. 
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Fig. 8.6 The sum of the transverse momenta of charged particles inside 
the TransMAX and TransMIN regions, as a function of the transverse 
momentum of the leading jet. from Ref. [133]. The solid curves are the 
predictions from PYTHIA and the dashed curves are the predictions from 
HERWIG. Reprinted with permission from Ref. [133]. 


inclusive jet calculations. The TransMIN region can not. There is good agreement be- 
tween the TEVATRON data and the PYTHIA tunes, not surprising since the data was 
used in the creation of the tunes. 
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8.2 Drell-Yan production 


As discussed in Chapter 2, W/Z production at hadron-hadron colliders serves as a pre- 
cision benchmark for Standard Model physics. This is true especially for the TEVATRON 
where, on the experimental side, the systematic errors are small. The decay leptons 
are easy to trigger on and the backgrounds are under good control. An electron cluster 
is formed from all towers in the electromagnetic calorimeter containing energy from 
the electron shower. The inclusion of such towers? in the electron energy will also 
effectively add to the electron 4-vector the energy from any collinear photons radiated 
by the electron. Thus, in the language of Section 2.1.6, these would be referred to as 
dressed leptons. This is not true for muon candidates, where no calorimeter energy 
is added to the muon 4-vector. Thus, these would be referred to as bare leptons. 

Identification cuts, using both tracking and calorimetric information, are applied to 
the lepton candidates to improve their purity. In addition, for electrons, an isolation 
cut is applied which requires that the energy around the electron candidate in a cone 
of radius R = 0.4 be less than a fixed amount (typically 4 GeV). As an alternative, 
it is sometimes required that the isolation energy is less than a fixed fraction of the 
electron’s transverse momentum. The isolation cut serves to reduce the rate for jets 
faking electrons. The isolation energy sum excludes the towers already included in the 
electron cluster. No such isolation cut is applied for muons. However, the energy de- 
posited by the muon in the calorimeter is required to be consistent with that expected 
from a minimum ionizing particle. 

For some of the theoretical predictions used for W/Z production at the TEVATRON 
PHOTOS [591] or other QED FSR simulations can be used to generate QED radia- 
tion for the leptons. This is true for PYTHIA as well as for RESBOS [300], a generator 
that accounts for the effects of resummation at small W/Z transverse momenta (as 
described in Section 5.3). Alternatively, the W and Z boson cross-sections and differ- 
ential distributions can be compared to the known NNLO QCD predictions, such as 
those provided by FEWZ2 [556]. 

The W and Z cross-sections measured at the TEVATRON are shown in Fig. 8.7 
for both Run I and Run II [122]. The experimental cross-sections agree well with the 
theory predictions at NNLO and the Run II cross-sections show the rise expected from 
the increase in centre-of-mass energy over Run I. 

The Z rapidity distribution measured by DØ in Run II is shown in Fig. 8.8 [94], with 
the measurement agreeing with the NNLO prediction over the entire rapidity range. 
This agreement is non-trivial since there is a shape change in the rapidity distribution 
between LO to NLO results, which is primarily driven by the differences between 
LO and NLO PDFs discussed in Chapter 6, as well as an increase in normalization. 
On the other hand, the transition from NLO to NNLO is essentially just a small K- 
factor with little change in shape (cf. Fig. 3.10). It is noteworthy that the largest 
experimental uncertainty for the W and Z cross-sections is the uncertainty in the 
luminosity, typically on the order of 5% at the TEVATRON (it is lower at the LHC). 


3A tower refers to the lateral segment of a calorimeter reading out a specific region in An and Ad. 
There may be more than one longitudinal division of the calorimeter within the tower, in which case 
the energies of all of the longitudinal divisions are added together to form the tower energy. 
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Fig. 8.7 W and Z cross-sections as a function of the centre-of-mass energy 
at the TEVATRON, from Ref. [122]. Copyright IOP Publishing. Reproduced 
with permission. All rights reserved. 


The theoretical systematic errors are primarily from the PDF uncertainty. Such cross- 
sections have thus proven to be useful as inputs to global PDF fits. 

The transverse momentum distribution for Z bosons, measured at CDF in Run II 
of the TEVATRON [65], is shown in Fig. 8.9 along with comparisons to predictions from 
RESBOS and FEWZ2. The FEWZ2 and RESBOS predictions agree well with the data 
at high transverse momenta. The inset shows the low transverse momentum region, 
where the RESBOS prediction also matches the data, including the turn-over of the 
cross-section for pr < 5 GeV, as expected from a resummation calculation. This very 
low pr region is sensitive to non-perturbative effects. FEWZ2 is not shown for the low- 
pr region as, being a fixed-order calculation, it will not provide sensible predictions. 
It is worth noting that the CDF pr data have been plotted with a very fine binning, 
made possible by the excellent tracking resolution of the CDF detector. Such binning 
allows a better determination of the low pr physics. 

For Drell-Yan production, the average transverse momentum of the lepton pair 
has been measured as a function of their invariant mass. Fig. 8.10 [65] shows the 
measurement made by CDF in Run II. The average transverse momentum increases 
roughly logarithmically with the square of the Drell-Yan mass, as expected from the 
discussion in Section 2.3.2. The data agree well with the default PYTHIA 6.2 prediction 
using Tune A. Also shown are two predictions involving tunes of PYTHIA that give 
larger or smaller values for the average Drell-Yan transverse momentum as a function 
of the Drell-Yan mass. The Plus/Minus tunes were used to estimate the initial- 
state-radiation uncertainty for the determination of the top mass in CDF. Most of 
the tt cross-section at the TEVATRON arises from qq initial states, so the Drell-Yan 
measurements serve as a good model. This will not be true at the LHC, where the 
dominant initial state for tt production is gg. 
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Fig. 8.8 The Z rapidity distribution from DØ in Run II. Reproduced with 
permission from Ref. [94]. 


8.3 Inclusive jet production 


Inclusive jet production at the TEVATRON plays an important role in this book for a 
number of reasons. First, it probes the highest transverse momentum range accessible 
at the TEVATRON. Second, it has a large impact on global PDF analyses. Finally, many 
of the subtleties regarding measurements with jets in the final state and the use of jet 
algorithms come into play, providing an introduction to some of the issues which will 
also be encountered at the LHC. 

In many events, the assignment of individual calorimeter towers into jets is fairly 
unambiguous and the jet structure in such final states is relatively clear. However, in 
some events, the complexity of the energy depositions means that different algorithms 
will result in different assignments of towers to the various jets. This is no problem to 
the extent that a similar complexity can be matched by the theoretical calculation to 
which it is being compared. This is the case, for example, for events simulated with 
parton shower Monte Carlos. However, a NLO calculation for inclusive jet production 
(or, indeed, any other process) can place at most two partons in a single jet. 

Before proceeding with a discussion of the TEVATRON jet analyses, it is worthwhile 
to step back and consider a few practical experimental implications. At the TEVATRON, 
IR-unsafe algorithms (cf. Section 2.1.6) were universally used. These included JetClu in 
CDF [108] and a comparable cone jet algorithm in D@ in Run I, and then the Midpoint 
cone algorithm in Run H (although JetClu continued to be used in CDF for many 
analyses). These algorithms required the presence of seeds in order for the jet clustering 
to begin, introducing the IR-safety problem that was discussed in Section 2.1.6. For 
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Fig. 8.9 The transverse momentum distribution for Z — e 
CDF in Run II (black crosses), along with comparisons to predictions 
from FEW2Z2 (dot-dash histogram) and RESBOs (solid histogram). The in- 
set shows a closeup of the low transverse momentum region. Reproduced 


with permission from Ref. [65]. 


Run II, the Midpoint jet algorithm was developed, which pushed the IR-safety problem 
to a higher order (NNLO).* The SISCone algorithm, which is IR-safe to all orders, 
was developed late in Run I, too late for any analyses,” but Monte Carlo comparisons 
using this algorithm and the Midpoint algorithm were carried out for many analyses. A 
few analyses were carried out with the ([R-safe) kr algorithm. The anti-kr clustering 
algorithm was developed too late for any TEVATRON analyses. 

The IR-safety problem applies only to fixed-order calculations, i.e. any of the jet 
algorithms mentioned above are IR-safe when applied to data/Monte Carlo, simply, 
because for hadrons there is no infrared problem with the minimal hadron mass being 
Mr © 135 MeV. The differences between the Midpoint and SISCone algorithm are 
finite and typically of the order of a few percent at most, so it is perfectly acceptable 
to use a SISCone jet algorithm in a fixed-order prediction to compare to data taken 
with the Midpoint algorithm. The stochastic instabilities introduced in Monte Carlo 
simulations, and inherent in the data itself, tend to level the playing field for many of 
the jet algorithms, since the extra effective seed in the Midpoint algorithm and the 
extra seeds in the SISCone algorithm have little impact [508]. 


4The Midpoint algorithm also introduced the use of the kinematic variables pp and y, in contrast 
to the variables Er and 7 used for the earlier algorithms. 


5Tyadition and inertia can make it difficult for experiments to give up old jet clustering algorithms, 


even if better algorithms are available. Thus, it is very good that both ATLAS and CMS have used the 
anti-k algorithm from the start. 
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Fig. 8.10 The average transverse momentum for Drell-Yan pairs from 
CDF in Run II, along with comparisons to predictions from PYTHIA. Re- 
produced with permission from Ref. [65]. 


8.3.1 Corrections 


For comparison of data to theory, the calorimeter tower energies clustered into a jet 
must first be corrected for the detector response. The calorimeters in the CDF ex- 
periment (or basically any experiment) respond differently to electromagnetic showers 
than to hadronic showers, and the difference varies as a function of the transverse 
momentum of the jet. The detector response corrections are determined using a de- 
tector simulation in which the parameters have been tuned to test-beam and in-situ 
calorimeter data. PYTHIA, with Tune A, is used for the production and fragmentation 
of jets. The same clustering procedure is applied to the final state particles in PYTHIA 
as is done for the data. The correction is determined by matching the calorimeter jet 
to the corresponding particle jet. An additional correction accounts for the smearing 
effects due to the finite energy resolution of the calorimeter. At this point, the jet is 
said to be determined at the hadron level. 

One of the observables that is crucial to be able to described is the jet shape. 
A study of this quantity is shown in Fig. 8.11 [122], where the jet energy away from 
the core of the jet (i.e. in the annulus 0.3 < R < 0.7) is plotted as a function of the 
transverse momentum of the jet. The general feature of these curves, that jets become 
more collimated as the jet transverse momentum increases, can be understood as due 
to three effects: First, power corrections that tend to broaden the jet decrease as 1/pr 
or 1/p%; second, a larger fraction of jets are quark jets rather than gluon jets; third, 
the probability of a hard gluon to be radiated (the dominant factor in the jet shape) 
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Fig. 8.11 The fraction of the transverse momentum in a cone jet of radius 
0.7 that lies in the annulus from 0.3 to 0.7, as a function of the transverse 
momentum of the jet. Comparisons are made to several tunes of PYTHIA 
(left) and to the separate predictions for quark and gluon jets (right). 
Reprinted with permission from Ref. [122]. 


decreases as ag(p7.). As can be seen in Fig. 8.11, the PYTHIA predictions using Tune 
A describe the data well, even better than with the default PYTHIA prediction. In fact 
a reasonable description of the jet shape can also be provided by the pure parton- 
level NLO prediction [510], perhaps supplemented by non-perturbative corrections, as 
discussed in Section 4.1, and in Ref. [680]. 

For data to be compared to a parton level calculation, the theory must be corrected 
to the hadron level.® In general, the data should be presented at the hadron level, and 
the corrections between hadron and parton level should be clearly stated. In retrospect. 
this seems obvious, but the TEVATRON jet measurements were one of the first analyses 
where this was true. 

The hadronization corrections consist of two components: the subtraction from 
the jet of the underlying event energy discussed in Section 8.1 and the correction for 
a loss of energy outside a jet due to the fragmentation process. The hadronization 
corrections can be calculated by comparing the results obtained from PYTHIA at the 
hadron level to the results from PYTHIA when the underlying event and the parton 
fragmentation into hadrons has been turned off. The underlying event energy is due 
to the interactions of the spectator partons in the colliding hadrons and the size of the 
correction depends on the size of the jet cone. As discussed earlier in this chapter, the 
rule-of-thumb has always been that the underlying event energy in a jet event looks 
very much like that observed in minimum bias events. 


SIn some analyses at the TEVATRON, the data was corrected to the parton level using the inverse 
of the parton to data corrections. 
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The fragmentation correction accounts for the daughter hadrons ending up outside 
the jet cone from mother partons whose trajectories lie inside the cone (also known as 
splash-out); it does not correct for any out-of-cone energy arising from perturbative 
effects as these should be correctly accounted for in a NLO calculation. It is purely a 
power correction to the cross section. The numerical value of the splash-out energy is 
roughly constant at 1 GeV for a cone of radius 0.7, independent of the jet transverse 
momentum. This constancy may seem surprising. But, as just discussed, as the jet 
transverse momentum increases the jet becomes more collimated. The result is that 
the energy in the outermost annulus, the region responsible for the splash-out energy, 
is roughly constant. The correction for splash-out derived using parton shower Monte 
Carlos can be applied to a NLO parton level calculation to the extent to which both 
the parton shower and the two partons in a NLO jet correctly describe the jet shape. 

The two effects of underlying event and splash-out produce corrections which go in 
opposite directions. Therefore they partially cancel when computing the total correc- 
tion for parton level predictions. For a jet of radius 0.7, the underlying event correction 
is larger, so the correction for the parton level prediction is positive. The total correc- 
tion is of the order of 7% for the lowest transverse momentum values in the inclusive 
jet cross-section measurement, decreasing rapidly to less than 1% at higher pr val- 
ues (falling roughly as 1/p7., as would be expected for such power corrections). The 
correction is roughly independent of rapidity. For a jet cone radius of 0.4, the fragmen- 
tation correction is somewhat larger (increasing as 1/R for small R) but the underlying 
event correction scales by the ratio of the cone areas (R?) [432]; as a result the two 
effects basically cancel each other out over the full transverse momentum range at the 
TEVATRON. 

Note that these two corrections deal with non-perturbative physics only. The as- 
sumption for the comparison to a NLO parton-only prediction is that the perturbative 
aspect of the jet shape is reasonably well-described by one gluon (in the NLO cal- 
culation) as with the parton shower (in the Monte Carlo). Thus, the fragmentation 
corrections determined for the latter can be applied to the former. Studies of the jet 
shape at NNLO should prove useful in testing this assumption. 


8.3.2 CDF inclusive jet results 


The inclusive jet cross-section measured by the CDF Collaboration in Run II in the 
central rapidity region using the Midpoint cone algorithm is shown in Fig. 8.12, as a 
function of the jet transverse momentum [116]. Due to the higher statistics compared 
to Run I, and the higher centre-of-mass energy, the reach in transverse momentum 
for the inclusive jet cross-section increased by approximately 150 GeV. The CDF 
measurement used the midpoint cone algorithm with a cone radius of 0.7. As discussed 
earlier in this section, the Midpoint algorithm places additional seeds (directions for 
jet cones) between stable cones having a separation of less than twice the size of the 
clustering cones. The Midpoint algorithm uses four-vector kinematics for clustering 
individual partons, particles or energies in calorimeter towers, and jets are described 
using rapidity (y) and transverse momentum (pr). 

A comparison of the inclusive jet cross-section measured by CDF in Run H with 
the Midpoint cone algorithm, to NLO QCD predictions using the EKS [509] program 
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Fig. 8.12 The inclusive jet cross-section from CDF in Run II. Reprinted 
with permission from Ref. [116]. 


with the CTEQ6.1 and MRST2004 PDFs, is shown in Fig. 8.13 for the five rapidity 
regions in the analysis [61]. l 

A renormalization and factorization scale of pł“ /2 has been used in the calculation. 
Typically, this leads to the highest predictions for inclusive jet cross-sections at the 
TEVATRON (for R=0.7), as discussed in Section 4.1.7 There is good agreement with 
the CTEQ6.1 predictions over the transverse momentum range of the prediction, in 
all rapidity regions. The MRST2004 predictions are slightly higher at lower pr and 
slightly lower at higher pr, but still in good overall agreement. 

As noted before, the CTEQ6.1 and MRST2004 PDFs have a higher gluon at large 
x as compared to previous PDFs, due to the influence of the Run I jet data from 
CDF and DØ. This enhanced gluon provides a good agreement with the high pr CDF 
Run I measurement as well. The D@ inclusive jet data taken in Run II, however, do 
not favour this higher gluon at large x, but instead prefer a weaker gluon. As will be 
observed in the next chapter, the inclusive jet data from the LHC do not yet provide 
a definitive answer. The curves indicate the PDF uncertainty for the prediction using 
the CTEQ6.1 PDF error set. The shaded band indicates the experimental systematic 
uncertainty, which is dominated by the uncertainty in the jet energy scale (on the order 
of 3%). It is important to note that for much of the kinematic range, the experimental 


TFor smaller jet sizes, a central scale of pit is perhaps more appropriate, and that is what is 
generally used in PDF fits currently. 
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Fig. 8.13 The inclusive jet cross-section from CDF in Run II, for several 
rapidity intervals using the Midpoint cone algorithm, compared on a linear 
scale to NLO theoretical predictions using CTEQ6.1 PDFs. Reprinted with 
permission from Ref. [61]. 


systematic errors are less than PDF uncertainties; thus, the use of this data has proven 
to be useful in global PDF fits. 


8.3.3 Jet algorithms and data 


The experimental jet cross-sections have also been measured in CDF using the kr 
algorithm with the same data sample and kinematic region as for the Midpoint anal- 
ysis [119]. See Fig. 8.14 for a comparison of the ratios of the two algorithms in data 
and in theory. Similarly good agreement to that obtained for the Midpoint cone algo- 
rithm is observed. This is an important observation. The two different jet algorithms 
have different strengths and weaknesses and it is useful to have data comparisons 
to both. However, this is one of the few cross-sections at the TEVATRON for which 
this comparison was performed. As will be observed at the LHC, the use of the anti-kr 
cross-section alone has become fairly universal, and there are few comparisons to other 
jet algorithms. The former point is good, as the anti-kr jet algorithm is arguably the 
best on the market. The latter is bad, as different jet algorithms (as well as different 
jet sizes) can spotlight different aspects of the underlying physics. 

It was noted in Section 2.1.6 that on general principles, for NLO parton level 
predictions, the cone jet cross-section is larger than the kr jet cross-section when 
Reone = D. At the hadron level, this is no longer true; the cone jet loses energy by the 
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Fig. 8.14 The ratios of the inclusive jet cross-sections measured with 
the kr algorithm (with D = 0.7) to those measured with the Midpoint 
algorithm (with R = 0.7) from CDF in Run II, for several rapidity intervals, 
with comparisons to the predictions of a NLO fixed-order QCD calculation 
and from PYTHIA. The results are from Ref. [61]. 


splash-out effect while the kr algorithm has a tendency to vacuum up contributions 
from the underlying event. This will be corrected at least partially by the hadron to 
parton level corrections for each algorithm. 

A particular complexity with the cone algorithm occurs when two jets overlap; a 
decision must be made whether to merge the two jets into one, or to separate them. 
This is an experimental decision; in CDF, the two overlapping jets are merged when 
more than 75% of the smaller jet energy overlaps with the larger jet. When the overlap 
is less, the towers are assigned to the nearest jet. DØ uses a criterion of a 50% fraction. 
NLO theory is agnostic on the subject as there is no overlap between the two partons 
that can comprise a jet. This point has become moot at the LHC in that cone algorithms 
are rarely used. 

Another problem that can arise on the particle or calorimeter level for a cone 
jet, but not on the NLO parton level, occurs when particles or calorimeter towers 
remain unclustered in any jet, due to the strong attraction of a nearby larger jet 
peak that will attract away any trial jet cone placed at the location of the original 
particles/calorimeter towers. The result will be what [507] calls dark towers, i.e. 
clusters that have a transverse momentum large enough to be designated either a 
separate jet or to be included in an existing nearby jet, but which are not clustered 
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into either. This is a feature endemic to any cone algorithm (including SISCone), but 
not to the kr family of jet algorithms. Thus, this is another advantage for the use of 
the anti-ky algorithm at the LHC. 

The TeV4LHC workshop writeup [133] recommended the following solution to the 
problem of unclustered energy with cone jet algorithms. The standard midpoint algo- 
rithm should be applied to the list of calorimeter towers/particle/partons, including 
the full split/merge procedure. The resulting identified jets are then referred to as 
first pass jets and their towers/particles/partons are removed from the list. The same 
algorithm is then applied to the remaining unclustered energy and any jets that result 
are referred to as second pass jets. There are various possibilities for making use of the 
second pass jets. They can be kept as separate jets, in addition to the first pass jets, 
or they can be merged with the nearest first pass jets. The simplest solution, until 
further study, is to keep the second pass jets as separate jets. 

It was originally thought that with the addition of a midpoint seed, the value of 
Rsep used with the NLO theory could be returned to its natural value of 2.0 (cf. 
Section 2.1.6). Now it is realized that the effects of parton showering /hadronization 
result in the midpoint solution virtually always being lost. Thus, a value of Rsep of 
1.3 (for split/merge fraction f=0.75) is required for the NLO jet algorithm to best 
model the experimental one. The inclusive jet theory cross-section with Rsep = 1.3 is 
approximately 3 — 5% smaller than with Rsep = 2.0, decreasing slowly with the jet 
transverse momentum. 


8.3.4 Inclusive jet production at the TEVATRON and global PDF fits 


Inclusive jet production receives contributions from gg, gq, and qq(qq) initial states 
as shown in Fig. 4.1. Thus, in principle, this process is sensitive to the nature of all 
the PDFs. The experimental precision of the measurement, along with the remaining 
theoretical uncertainties, means that the cross-sections do not serve as a meaningful 
constraint on the quark or antiquark distributions. However, they do serve as an 
important source of information on the gluon distribution, especially at high x. The 
addition of the jet data from CDF and D@ resulted in a larger gluon distribution at 
high z than present in PDFs determined without the TEVATRON jet data. The influence 
of the high Er Run I jet cross-section on the high x gluon is evident. There is always 
the danger of sweeping new physics under the rug of PDF uncertainties. Thus, it is 
important to measure the inclusive jet cross-section over as wide a kinematic range as 
possible, as was done by DØ in Run I [103] and by CDF [61] and DØ [86] in Run 11. 
The generic expectation is that most signals of new physics would tend to be central 
while a PDF explanation should be universal, i.e. fit the data in all regions. 

As inclusive jet production probes high scales, it serves as a useful observable 
to search for the presence of quark compositeness. Fig. 8.15 compares the D@ jet 
cross-sections measured in Run I to the NLO QCD predictions using the CTEQ6.1 
PDFs, along with the cross-sections for jet production including a four-Fermi contact 
interaction (as discussed in Section 4.1). The mass scale of the contact interaction, A 
(cf. Eq. (4.18)), is probed at three values (1.6, 2.0 and 2.4 TeV), assuming constructive 
interference [867]. The cross-section is plotted as a ratio to the pure QCD prediction. 
The effect of the contact term is limited to the central rapidity regions (with of course 
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curves, from top to bottom). Reprinted with permission from Ref. [867]. 


the size of the effect decreasing with increasing mass scale). This DØ data was used in 
the determination of the CTEQ6.1 PDFs. If there were a contact interaction, then the 
PDFs would need to be refit, comparing the data to the theory with compositeness 
included. 


8.4 Inclusive photon and diphoton production 


Measurements of single and double photon production also serve as precision tests of 
perturbative QCD, while avoiding the subtleties of jet definition, and were extensively 
studied at the TEVATRON [63, 67, 82, 90, 91, 93, 102, 110, 120]. Due to the presence 
of the electromagnetic coupling a, and the more limited number of subprocesses, the 
rate is suppressed with respect to jet production. In fact, photon measurements suffer 
from backgrounds due to the rare jets in which a large fraction of the momentum of 
the jet is taken by one or more 7° (or 7) mesons. Each meson decays into two photons 
that typically can not be resolved due to the finite granularity of the calorimeter. Such 
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Fig. 8.16 (left) The energy distribution in an isolation cone about the 
photon direction. Shown are the contributions from true photons and from 
backgrounds. (right) The resultant photon signal fraction as a function of 
photon transverse energy. Reprinted with permission from Ref. [58]. 


events fall on the tail of the (rapidly falling) jet fragmentation function, but the large 
rate of jet production means that the background can not be ignored. 

To reduce the jet backgrounds, the photons are typically required to be isolated, 
loosely at the trigger level, and more tightly off-line, similar to what is done for elec- 
trons. For example, in the CDF measurements in Run II, a requirement is made that 
the additional energy in a cone of radius R = 0.4 about the photon direction is less 
than 2 GeV, once the pileup energy from additional minimum bias events has been 
subtracted. An isolation requirement reduces not only jet backgrounds, but also contri- 
butions from photon fragmentation functions. This isolation is tighter than the typical 
isolation cuts applied at the LHC. 

An example of a photon isolation distribution is shown in Fig. 8.16 (left) [58]. The 
negative energy tail is a result of the pileup subtraction. The photon fraction in each 
kinematic bin can be determined using templates of the isolation energy for photon 
candidates and backgrounds for each kinematic bin. The resulting true photon fraction 
is shown in Fig. 8.16 (right). The photon fraction of the sample rises rapidly towards 
one as the photon candidate transverse momentum increases, as expected, since the 
fixed isolation cut requires the fraction z of the jet momentum taken by the leading 7° 
also increases. This is a general rule-of-thumb for both the TEVATRON and the LHC: 
an isolated photon-like object is almost always a real photon. 

The resulting cross-section is shown in Fig. 8.17 [58]. Good agreement is observed 
with the NLO prediction from JETPHOX, except perhaps at low Er, where the data 
is higher than theory. This has been observed in several other TEVATRON photon 
measurements, and has been attributed to the effects of soft gluon radiation. However, 
as will be seen in the next chapter, this has not been seen for similar measurements 
at the LHC. Note that the cone isolation cut greatly suppresses the fragmentation 
contribution to photon production, but does not explicitly remove it. 

The photon + jet cross-section is interesting in its own right and, as discussed 
already in Section 4.2, as an input to PDF fits. It can also be used for a calibration 
of the jet energy scale. The electron energy scale is known very precisely from mea- 
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Fig. 8.17 The isolated single photon cross-section measured in CDF com- 
pared to NLO QCD predictions from JETPHOX [348]. Reprinted with per- 
mission from Ref. [58]. 


surements of Z — e*te~ [95]. This energy scale can be transferred to photons, taking 
into account the differences in electromagnetic showers between photons and electrons, 
and is limited primarily by an incomplete knowledge of the material in front of the 
calorimeter. In DØ, for example, a tight selection is made on photon candidates to 
reject as much background from jet fragmentation as possible [98]. A requirement is 
made that no other jets be present in the event, that no additional minimum bias 
events (pileup) be present, and that the photon and the jet be back-to-back (Ad > 2.9 
radians). The jet energy response is then corrected by looking at the balance in pr 
between the photon and the jet using the missing transverse energy projection fraction 
method [98]. Corrections are also made for the presence of underlying event energy, 
and for gluon radiation outside of the jet cone. The resulting uncertainty on the jet 
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Fig. 8.18 (left) The sources for uncertainty of the jet pr response in the 
central rapidity region. Here, the variable E = pr,y cosh njet is used on 
the horizontal axis, as the resolution is better than for jet energy. (right) 
The resultant jet energy scale uncertainty, as a function of jet transverse 
momentum, for the central rapidity region. Reprinted with permission from 
Ref. [98]. 


pr response is shown in Fig. 8.18 (left) and is dominated by the photon energy scale 
error. The method is restricted to the central rapidity region and is limited by the 
statistics of the photon + jet sample to jets below 350 GeV. The energy scale can 
be transferred to non-central rapidities using transverse momentum balancing in dijet 
events. The energy scale for jets above that transverse momentum must be extrap- 
olated. The final fractional uncertainty on the jet energy scale is shown in Fig. 8.18 
(right) for the central rapidity region. 

Diphoton production has also been extensively studied at the TEVATRON. In CDF, 
for example, a study was carried out using the full Run II data sample, using the 
same cuts and analysis techniques as for single photon production [67].8 In general, 
the phenomenology is richer than for single photons. For example, the transverse 
momentum cuts on the photons sculpt the diphoton pr spectrum, giving rise to a 
bump in the spectrum often referred to as the Guillet shoulder [254, 255]. This can be 
seen in Fig. 8.19, which shows a CDF measurement of the diphoton pr distribution [67], 
with the shoulder around 35 GeV. This feature is due to configurations in which the 
photon pair is accompanied by significant additional hard radiation. 

As pointed out in Section 4.2.3, in particular in Table 4.1, this distribution is 
difficult to describe with the first few orders of a fixed-order calculation. An NLO cal- 
culation of the total diphoton cross-section, corresponding to the MCFM prediction in 
Fig. 8.19, gives the first non-trivial prediction. Since the diphoton pair recoils against a 
single parton, this calculation is insufficient to capture the shoulder and the description 
of the data is relatively poor. In the NNLO calculation, which accesses configurations 
with two recoiling partons for the first time, the shoulder begins to be reproduced 
and the theoretical description is much improved. The SHERPA prediction, which here 


8Diphoton production has backgrounds from either one or both photons being faked by jets. Again, 
as the Er of the photons increases, the purity fraction increases. 
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Fig. 8.19 The diphoton pr distribution compared to theoretical predic- 
tions that are accurate for the total cross-section at NNLO QCD [340], 
at NLO QCD using Mcrm [311], and to SHERPA [583]. Reprinted with 
permission from Ref. [67]. 


contains LO matrix elements for multiple parton emission, is similarly well-suited to 
describe the data. The same events are also responsible for an excess of events, with 
respect to pure NLO predictions, at low diphoton mass, and at low Ag. 


8.5 Vector boson plus jet physics 


As discussed in Section 4.3, measurements of vector boson production in association 
with jets serve as a precision test of perturbative QCD over a wide dynamic range, in 
the presence of a large mass scale (myz). The process also serves as a background 
to tt production (for the W — lv + jets final state), as well as to possible new physics 
signals. Typically, a smaller jet size is preferred for final states that may be complicated 
by the presence of a large number of jets, so in most cases at the TEVATRON a jet size 
of R = 0.4 was used for measurements of this process. This process was well-studied 
at the TEVATRON by both CDF and DØ [59, 60, 85, 88, 96, 99]. 

For example in Fig. 8.20 (left), the W boson pr is shown for several different jet 
multiplicities and compared to predictions at LO and NLO in QCD. Good agreement 
is observed for the BLACKHAT+SHERPA NLO prediction. Here, the jet transverse mo- 
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mentum cut is 20 GeV.° Below a W boson pr of 20 GeV the W boson recoils against 
more than one hard jet, and the effects of soft gluon emission also become important. 

The Hr distribution (the sum of the transverse momenta of all jets and leptons 
(including neutrinos) is plotted in Fig. 8.20 (right) as a function of the jet multiplicity. 
The Hr variable is particularly sensitive to higher order QCD effects, but is also 
often chosen as a variable in which to look for the presence of BSM physics. There 
is a significant variation observed in the level of agreement between the data and the 
predictions evident in the figure, possibly allowing for the possibility of improvements 
to these predictions. Note in particular the tendency of the NLO BLACKHAT+SHERPA 
predictions to lie below the data at high Hry for the > 1 jet bin. We will return to this 
observation in Chapter 9. 

Note that the DØ measurements were carried out with the midpoint Cone jet 
algorithm, but the results were corrected to the (infra-red safe) SISCone jet algorithm 
using SHERPA. For the leading jet transverse momentum, this correction is very small 
as the two algorithms are very close in behaviour. 

A measurement of Z+jets production was carried out at CDF using the full data 
sample [76]. The jet multiplicity distribution is shown in Fig. 8.21 for data com- 
pared to BLACKHAT+SHERPA (left), and ALPGEN +PYTHIA, POWHEG +PYTHIA and 
LoopSim+McrM (right). Recall that this last prediction is obtained by ameliorating 
an exact NLO calculation with an approximate treatment of NNLO effects, according 
to the procedure described in Section 3.4.2. The midpoint jet algorithm with R = 0.7 
was used (in contrast to the other V+jets measurements which used R = 0.4). It is no- 
ticeable that the measured cross-section for Z+ > 3 jets is approximately 30% larger 
than the BLACKHAT+SHERPA prediction, contrary to what has been observed at the 
LHC using the anti-kp jet algorithm with R = 0.419 and with W+ > 3 jets (using 
R = 0.5) at the TEVATRON [96]. The Monte Carlo predictions are in better agreement 
with the high jet multiplicity data, albeit with larger uncertainties. Looking back at 
Fig. 4.22, the cross-section predictions for the SISCone jet algorithm (for > 4 jets, but 
the situation is roughly similar for > 3 jets) tend to peak at smaller scales than do the 
predictions for the anti-kr jet algorithm. The differences increase as the jet multiplicity 
and the jet size increases. The peak cross-section values for the TEVATRON measure- 
ment are actually quite similar for the two algorithms, as discussed in Section 4.3 [227]. 
The Z+ > 3 jet cross-section was not determined using the anti-kr jet algorithm in 
the CDF measurement, but a comparison was carried out using simulated data. The 
resulting cross-sections, for the two jet algorithms, are much closer than implied by 
the BLACKHAT+SHERPA predictions using a scale of Hy /2. 


8.6 ti production at the TEVATRON 


The greatest discovery at the TEVATRON was that of the top quark [81, 112]. The 
production mechanism was through tt pair production, dominated by a qq initial state. 
The top quark decays essentially 100% into a W and a b quark; thus the final states 


9 As we will see in Chapter LHC, the jet cuts at the LHC are typically higher than 20 GeV, due to 
increased backgrounds from pileup and the underlying event. 

10 Although the cross-section for Z+ > 4 jets has been calculated for the LHC, it has not been 
calculated for the TEVATRON. Thus, there are no BLACKHAT+SHERPA predictions for that final state. 
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Fig. 8.20 The W boson transverse momentum (left) and Hr (right) dis- 
tributions in inclusive W + n-jet events, for 1 < n < 4, measured by DØ. 
Reprinted with permission from Ref. [99]. The measurements are compared 
to several theoretical predictions. 


being investigated depend on the decays of the two W’s. The most useful (combination 
of rate and background) final state occurs when one of the W’s decays into a lepton 
and neutrino and the other decays into two quarks. Thus, the final state consists of 
a lepton, missing transverse energy and of the order of four jets. The number of jets 
may be less than 4 due to one or more of the jets not satisfying the kinematic cuts, or 
more than 4 due to additional jets being created by gluon radiation off the initial or 
final state. Because of the relatively large number of jets, a smaller cone size (R = 0.4) 
has been used for jet reconstruction, with the CDF analyses in Run 2 using the JetClu 
cone algorithm and the DØ analyses the Midpoint cone algorithm. No top analysis 
has been performed using the kr jet algorithm. There is a sizeable background for 
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Fig. 8.21 The jet multiplicity distribution for data compared to 
BLACKHAT+SHERPA (left), and ALPGEN +PYTHIA, POWHEG +PYTHIA and 
LoopSim+MCFM (right). Reprinted with permission from Ref. [76]. 


this final state through QCD production of W+ jets. Two of the jets in tt events are 
created by b quarks; thus there is the additional possibility of an improvement of signal 
purity by the requirement of one or two b-tags. 

Top pair events also have a harder Hr (sum of the transverse energies of all jets, 
leptons and missing transverse energy in the event) than does the W + jets back- 
ground. This is due to the harder spectrum of the jets from tt decays (compared to 
the background), resulting from the large mass scales inherent in top production. A 
requirement of large Hr thus improves the tt purity. 

The jet multiplicity distribution for the top candidate sample from CDF in Run II 
is shown in Fig. 8.22 for the case of one of the jets being tagged as a b-jet (left) and two 
of the jets being tagged (right) [118]. The requirement of one or more b-tags greatly 
reduces the W+ jets background in the 3 and 4 jet bins, albeit with a reduction in 
the number of events due to the tagging efficiency. The b-tagging efficiency at the 
TEVATRON was typically of the order of 40%. As will be seen in the next chapter, the 
b-tagging efficiency is higher at the LHC, primarily due to a larger rapidity coverage 
for the silicon detectors. The high jet multiplicity double-b-tagged events are almost 
exclusively tt events. 

The top pair cross-section has been measured in a variety of final states (depending 
on the W boson decays) with a variety of techniques. A compilation of measurement 
results from CDF and DØ is shown in Fig. 8.23 [70]. The cross-sections from the two 
experiments agree with each other and with the theoretical predictions. 
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Fig. 8.22 The expected number of W+ jets events that are b-jet tagged 
(left) and double-b-tagged (right), indicated by source. Reprinted with per- 
mission from Ref. [118]. 
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Fig. 8.23 A compilation of cross-section measurements for tt final states 
from CDF and DØ at the TEVATRON. Reprinted with permission from 
Ref. [70]. 


8.6.1 Measuring the top mass 


The lepton + jets final state (with one or more b-tags) is also the most useful one for 
the determination of the top mass (although there is top mass information in all of 
the final states). This final state optimizes the most information as to the top mass, 
with the least background from non-top processes. For example, the final state with 
two high-pr leptons, large missing transverse energy, and two (b) jets has the least 
background, but the presence of two neutrinos makes the reconstruction of the final- 
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state kinematics difficult. For the final state with 6 jets (2 b-jets), complete kinematic 
information is available; however, the QCD background from six-jet production is very 
high. 

The high statistics for top production accumulated at the TEVATRON has allowed 
cross-checks of the calibration techniques for the top mass reconstruction, as for exam- 
ple the calibration of the light quark jet response using the decay of the hadronically 
decaying W boson (in té events) into light quarks. 

There are two main techniques at the TEVATRON for top mass determination: 
the template method and the matrix element method. In the template method, the 
information from the lepton + jets final state is input to a x? determination, where the 
reconstructed top mass is a free parameter. The x? is minimized for each possible way 
of assigning the 4-vector information for each of the four leading jets to the top decay 
products (if any jets are b-tagged, then they are required to be from the top decay and 
not from the W boson decay). The x? expression has terms for the uncertainty on the 
measurements of the 4-vectors of the decay products. There are two possible solutions 
for the longitudinal momentum of the neutrino, so both are used. The minimum chi- 
square solution (for jet assignment, neutrino solution and Mop) is chosen for each 
event. 

The matrix element method has the ability to use theoretical information from 
the matrix element, retaining all of the hard scattering correlations, in the top mass 
determination. A likelihood is determined for each event that the theoretical model 
from the matrix element describes the kinematics of the event. The technique is very 
CPU-intensive, and until recently was restricted to the use of leading order matrix 
elements. In [318], the matrix element method was extended to allow the calculation 
of next-to-leading order weights on an event-by-event basis. 

In either method, the determination of the top mass is obtained by comparing data 
with Monte Carlo predictions. Thus, the top mass can not be strictly identified with 
any precise theoretical definition, such as the MS mass or the pole mass discussed in 
Section 4.5. However, the differences should be smaller than the current uncertainties 
on the top mass from the TEVATRON measurements, but may be an issue for future 
more precise determinations at the LHC. 

A compilation of top mass determinations from the TEVATRON, with the global 
mass fit dominated by lepton+jets final states, is shown in Fig. 8.24 [599]. The top 
mass has been determined from the TEVATRON measurements to 0.64 GeV, a precision 
of about 0.4%. All of the individual determinations of the top mass are consistent with 
each other. These determinations of the top quark mass, together with the measured 
W-boson mass, provide an indirect constraint on the mass of the Higgs boson. This is 
shown in Fig. 8.25 [599]. 

The precision of the top mass determination at TEVATRON has reached the point 
where some of the systematics due to QCD effects must be considered with greater 
care. One of the potentially important systematics is that due to the effects of initial- 
state radiation. Jets created by initial state radiation may replace one or more of the 
jets from the top quark decays, affecting the reconstructed top mass. In the past, 
the initial state radiation (ISR) systematics was determined by turning the radiation 
off/on, leading to a relatively large impact. A more sophisticated and more correct 
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Fig. 8.24 A compilation of the top quark mass measurements from CDF 
and DØ, from arXiv:1608.01881. 


treatment was adopted in Run II, where the tunings for the parton shower Monte 
Carlos were modified leading to more/less initial-state radiation, in keeping with the 
uncertainties associated with Drell-Yan measurements as discussed in Section 8.5. The 
resultant tt pair transverse momentum distributions are shown above in Fig. 8.26. The 
changes to the tt transverse momentum distribution created by the tunes are relatively 
modest, as is the resultant systematic error on the top mass determination. 

Note that the peak of the tt transverse momentum spectrum is somewhat larger 
than that for Z production at the TEVATRON, due to the larger mass of the tt system. 
As both are produced primarily by q@ initial states, the difference is not as large as it 
is at the LHC, where the primary tt production mechanism is through gg fusion. 

It is also interesting to look at the mass distribution of the tt system, as new physics 
(such as a Z’ [622]) might couple preferentially to top quarks. Such a comparison for 
Run II is shown in Fig. 8.27 without any signs for a high mass resonance [69]. The 
simulation of the Standard Model tt signal for this analysis was carried out with 
POWHEG using MSTW2008 NLO PDFs. Often, in previous TEVATRON studies, a LO 
Monte Carlo such as PYTHIA was used, along with a LO PDF. Note that if we compare 
the predictions for the tt mass distribution at LO and NLO, we see that the NLO cross- 
section is substantially less than the LO one at high mass. Further investigation shows 
that the decrease of NLO compared to LO at high mass is found only in the qq initial 
state and not in the gg initial state. In fact, at the TEVATRON, the ratio of NLO to 
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Fig. 8.25 Implications of the measured top quark and W-boson masses 
for the mass of the Higgs boson, from ref. [599]. 
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Fig. 8.26 The PYTHIA predictions for the tt transverse momentum using 
the Plus/Minus tunes. We thank Prof. Un-Ki Yang for this figure. 


LO for gg initial states grows dramatically with increasing top pair invariant mass. 
This effect is largely due to the increase in the gluon distribution when going from 
CTEQ6L1 in the LO calculation to CTEQ6M at NLO. For instance, at x ~ 0.4 (and 
hence an invariant mass of about 800 GeV) the gluon distribution is about a factor two 
larger in CTEQ6M than in CTEQ6L1, giving a factor four increase in the cross-section. 
Conversely, the quark distribution is slightly decreased at such large x. If NLO PDFs 
were used for both the numerator (NLO) and denominator (LO), this dramatic effect 
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Fig. 8.27 The tt mass distribution measured by CDF in Run II. Reprinted 
with permission from Ref. [69]. 


would not exist. This is an example of the danger if LO PDFs are used, especially in 
discovery physics regions. 

In any case, the absolute contribution of the tt cross at high masses from gg initial 
states at the TEVATRON is small, due to the rapidly falling gluon distribution at high 
x. The dominant tt production mechanism at the LHC in all mass regions is through 
gg fusion. 


8.6.2 The forward-backward asymmetry 


Given the symmetric initial and final states for the reaction pp — tt, it would be 
natural to expect that the angular distribution of the t and the t would be symmetric 
around rapidity y = 0. This is certainly true in a leading order QCD matrix element 
calculation. However, as discussed at length in Section 4.5.3, this is no longer true at 
NLO. The net production asymmetry at NLO in QCD, including EW effects which 
increase the asymmetry by a factor of 1.26 [702, 747], is of the order of 7% at the 
TEVATRON. The larger values for the asymmetry (compared to theory) measured by 
CDF [68, 71], and DØ [83, 100], have resulted in a number of new calculations of 
both SM effects as well as possible BSM contributions to the (larger than expected) 
asymmetry. Given the large mass of the top quark, it seems a likely place to look for 
the presence of new physics. The recently calculated NNLO QCD asymmetry increases 
the inclusive asymmetry prediction by a factor of 1.3 [428], in better agreement with 
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Fig. 8.28 The inclusive tt forward-backward asymmetry at the TEVATRON 
is shown for CDF and DØ, compared to theoretical predictions (QCD and 
QCD+EW) using two different definitions for the ratio, as discussed in the 
text. Reprinted with permission from Ref. [428]. 


the measurements. The inclusive asymmetry values from experiment and the NNLO 
calculation are shown in Fig. 8.28. There is an ambiguity in the manner in which the 
asymmetry can be calculated, i.e. whether the exact results are used for the numerator 
and denominator, or whether the asymmetry ratio is expanded in terms of powers of 
a,(mz). In the figure, the exact results at each order are shown in capital letters, 
while the small letters refer to the expanded version of the calculation. The first four 
theory predictions in the figure are QCD-only while the second four are QCD+EW. 
The expanded version results in a larger value for the asymmetry with smaller errors. 
The authors of [428] prefer the exact result, so that is used in the figures below. 

The asymmetry typically is measured as a function of the three dynamical variables: 
AYiz, Mz, and prs The expectation is that at NLO in QCD the asymmetry should 
increase linearly with the variables AY,z and m,;,'! while it should switch signs at 
non-zero pr (the ISR/FSR radiation is the only contribution to the asymmetry away 
from pr zz = 0). 

At the TEVATRON, the asymmetry is not a subtle effect [68]. It is observable even 
at the raw data level, as shown in Fig. 8.29 (left). The effects of the background and 
of the non-perturbative effects are to dilute the net asymmetry, so after corrections 
for these effects, the asymmetry increases, as shown in Fig. 8.29 (right). Note that the 
parton level asymmetry is larger than the background-subtracted asymmetry, which 
is larger than the asymmetry at the reconstructed level.) 

Comparisons of the parton-level NLO and NNLO QCD (no EW) asymmetry pre- 
dictions to the CDF and DØ measurements for AY;; (left) and m,;(right) are shown 


11The asymmetry is primarily proportional to 8, the velocity of the top (or anti-top) in the tt 
centre-of-mass frame [141]. 
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Fig. 8.29 (left) The Ay distribution at the reconstruction level, before 
any background subtraction or unfolding, compared to predictions from 
signal and background models. (right) A comparison of the asymmetry as a 
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Fig. 8.30 A comparison of the pure QCD (NLO and NNLO) tt asym- 
metry predictions to the data from CDF and DØ, as a function of (left) 
the rapidity separation between the t and the t, and (right) the tt mass 
distribution. Reprinted with permission from Ref. [428]. 


in Fig. 8.30, from [428]. The CDF results tend to be higher than those from DØ, but 
both are in statistical agreement. Note that many physics models that could increase 
the asymmetry at high m,z should also cause a change in the observed tt mass spec- 
trum. However, as shown in Fig. 8.27, no such deviation is observed, and the mass 
distribution agrees with NLO predictions. 

Predictions at NLO and NNLO (pure) QCD for the asymmetry as a function of 
the transverse momentum of the tt pair are shown in Fig. 8.31 (left). The NNLO 
corrections greatly decrease the size of the (negative) asymmetry for non-zero p% 
values, leading to the net increase in the inclusive asymmetry discussed previously. The 
CDF data are shown in Fig. 8.31 (right), compared to predictions from POWHEG and 
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Fig. 8.31 (left) Predictions for the QCD asymmetry as a function of the 
tt transverse momentum at NLO and NNLO. Reprinted with permission 
from Ref. [428]; (right) the CDF tt asymmetry distribution as a function 
of the transverse momentum of the tt pair, compared to predictions from 
POWHEG and PYTHIA. Reprinted with permission from Ref. [68]. 


PYTHIA. The asymmetry does decrease with increasing transverse momentum, as do 
the predictions, but there is not good agreement. It is interesting that the slope of the 
PYTHIA prediction seems similar to that observed in the data. It might at first glance 
seem surprising that PYTHIA, using LO QCD matrix elements, predicts any inclusive 
asymmetry at all. The parton shower will create additional jets, but as they result 
from the shower, and not from a matrix element, there is no net ISR/FSR interference 
as discussed above. However, as discussed in Section 4.5.3, there is a greater likelihood 
for gluons to be emitted if the colour charges are accelerated. As the production of the 
jet is more likely if the top (t) is produced in the backward (forward) direction, the 
net asymmetry for the ttj final state is negative. Also, the result of the gluon emission 
is that the tt system moves from pr = 0 to a higher transverse momentum, leaving 
the events where no radiation has taken place (near or at pr = 0), with a positive 
asymmetry, since the sum of the two asymmetries for the leading order prediction 
must be zero. Detailed investigations of colour coherence effects and of the treatment 
of recoils in the parton showering have shown that the asymmetry as a function of the 
transverse momentum of the tt pair can in addition be substantially affected by the 
details of the treatment of those effects [627, 859]. 

Unfortunately, the study of the tt asymmetry at the TEVATRON has ended, and 
similar investigations at the LHC are much more difficult due to the dilution of the effect 
(the gg initial state dominates, and the initial hadrons are both protons). There is no 
evidence for the presence of any BSM physics at either the TEVATRON or the LHC and 
improved calculations of QCD and EW effects have lead to better improvements of the 
data with the theory. A complete understanding, though, may require the combination 
of investigations at both the NNLO and the parton shower levels. This observable 
seemed to be a clear sign of possible new physics, but instead showed the complexity 
and subtlety of QCD+EW predictions in a hadron-hadron environment. The Standard 
Model once again appears to rule. 
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8.6.3 Single top production 


Single top production was first observed at the TEVATRON [73]. As discussed in Sec- 
tion 4.6, the measurement of single top production provides a sensitivity to the value 
of the CKM matrix element V, and thus a sensitivity to a variety of possible new 
physics [149, 357, 870]. There are three production channels, s-channel, t-channel, and 
tW production, with the first two dominating at the TEVATRON. Closer inspection 
reveals that the latter channel tW-associated production is the part of a larger set of 
Feynman diagrams at leading order, which lead to a bbW t W7 final state. Clearly, one 
could try to differentiate three kinematic regions, by allowing zero, one, or two top 
quarks becoming resonant, with a corresponding peak in the bW mass distribution — 
their decay products. In addition to this complication, there are also ambiguities in the 
definition of s-channel and t-channel production; indeed as discussed in Section 4.6, 
there is no distinction between them at higher orders in QCD. However, t-channel 
events tend to produce light flavour jets at high rapidity, while s-channel events are 
more likely to contain two b-jets in the central rapidity region. The jet—7 distribution 
in t-channel production is strongly peaked in the forward direction due to the larger 
momentum that the incident valence quark participating in the t-channel process has 
compared to the incident gluon (the jet 7 direction for the spectator jet accompanying 
single t production would be peaked in the backward direction). The order a; cor- 
rections shift the spectator jet to even more forward 7, due to the impact of gluon 
radiation. 

There are significant backgrounds to both channels, from W/Z+jets, dibosons, tt, 
and even Higgs boson production. The W boson + jets normalization is taken from 
data (in regions which are not likely to contain single-top events; the shape is taken 
from the NLO prediction), while the cross-sections for the other background processes 
are taken from their theoretical predictions. 

There is no kinematic variable that allows for a distinct separation between single 
top production and its backgrounds. Thus, multivariate discriminants are used to 
optimize the separation. The distribution of the discriminant is shown in Fig. 8.32 
(left), for the combined CDF +D@ combination, with a t-channel signal being evident 
at large negative values of the discriminant and an s-channel signal being evident at 
large positive values [73]. The measured single top cross sections for CDF, DØ and 
for the combination are shown in Fig. 8.32 (right). The results are in good agreement 
with Standard Model predictions [73], and allow a determination of |V;,| = 1.02308. 
At 95% CL, |Vij| > 0.92. 


8.7 Higgs boson searches 


The most challenging analysis of the TEVATRON involved the search for the Higgs 
boson. The relatively low signal cross section and the large backgrounds required (1) 
a large integrated luminosity, (2) the use of multivariate analysis techniques, (3) the 
combination of a large number of Higgs boson production and decay channels, and (4) 
the combination of results from the CDF and D@ experiments. The final results [66] 
involved the complete TEVATRON data sample (approximately 10 fb~+), the gg fusion, 
associated production (VH), vector boson fusion and ttH production channels, and 
the bb, WtW-, rtr, yy and ZZ decay modes. With the multivariate analyses, it 
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Fig. 8.32 (left) The distribution of the discriminant used for the sepa- 
ration of single top production and its backgrounds. The black solid line 
shows the total background. (right) The measured single top production 
cross-sections from CDF and DØ, and the TEVATRON combination, com- 
pared to a prediction at NLO+NNLL. Reprinted with permission from 
Ref. [73]. 


was important to separate the event selections into orthogonal search regions, so that 
the multivariate analyses could be optimized for each region. For example, for the 
H — W+*W~ decay mode, where both W bosons decay leptonically, it was useful for 
the searches to be broken into 0,1 and > 2 jet final states. 

One of the backgrounds for the VH(— bb) searches is VZ(—> bb) production; 
however, this also serves as a useful calibration tool. The background-subtracted dis- 
tribution for the reconstructed dijet mass in VZ final states is shown in Fig. 8.33. The 
reconstructed cross section agrees well with the Standard Model prediction. 

The best-fit cross Higgs boson signal cross section as a function of Higgs boson mass 
for the final TEVATRON result is shown in Fig. 8.34 (left). A broad excess in the range 
of 120-140 GeV can be observed; also shown are the expectations for the production 
of a Higgs boson with either the Standard Model cross section, or the Standard Model 
cross section multiplied by a factor of 1.5. This excess has a significance of 3.0 standard 
deviations and can be associated with the production of a Higgs boson with a cross 
section that is a factor 1.44t9:59 times the SM prediction. The best-fit values for the 
cross sections from the four decay modes shown in Fig. 8.34 (right) are all consistent 
with each other, and with the SM predictions. 


8.8 Summary 


The TEVATRON was the first hadron-hadron machine in which modern techniques 
could be used for event reconstruction and analysis, and for comparison of data to 
theory. Jet algorithms were developed that allowed more precise theoretical compar- 
isons, although in most cases not with the full all-orders infrared-safety desired. Some, 
but not all, measurements were presented at the hadron level, with complete informa- 
tion about the parton-to-hadron corrections. Results at the hadron level allow theorists 
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Fig. 8.33 The background-subtracted data for the reconstructed dijet 
mass for the combined CDF and DØ measurements of VZ production. 
Reprinted with permission from Ref. [66]. 
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Fig. 8.34 (left)The best-fit signal cross sections, expressed as a ratio to 
the SM prediction, as a function of the Higgs boson mass, using the com- 
bined CDF and DØ data samples. (right) The best fit values (ø x Br) for the 
combined CDF and DØ Higgs boson search channels, for the yy, WWT, 


Tr and Vbb final states. Reprinted with permission from Ref. [66]. 


(or other experimentalists) to better compare their predictions against experimental 
data. An alternative commonly used in the experimental community was to compare 
data and theory at some intermediate level of reconstruction, accessible only to the 
experimentalists who carried out the measurement. Many measurements were carried 
out in fiducial regions, allowing the comparison of data to theory without extrapo- 
lations which may have model-dependence. As will be seen in the next chapter, this 
extrapolation can still cause problems at the LHC. 
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Advances in theoretical techniques, and in computing power, allowed the calcu- 
lation of processes with multi-jet final states at LO and NLO, often in the context 
of a parton shower Monte Carlo. With the new theoretical predictions, and the ac- 
cess to larger scales at the TEVATRON than previously accessible, the Standard Model 
was tested with great precision. Alas, although the TEVATRON was capable of dis- 
covering and of measuring the properties of the top quark, and of finding evidence 
for the existence of the Higgs boson, it completed its run without the discovery of 
new physics. Searches for new physics, and for further precision measurements of the 
standard model, had to await the LHC. 


9 
Data at the LHC 


This chapter on LHC results presents the culmination of the theoretical techniques de- 
veloped in the earlier chapters, along with the data analysis experiences from TEVATRON. 
Here, a wide range of exemplary data from Run I at 7 and 8 TeV will be discussed.! 
For the cross-sections discussed in this chapter, the data have been corrected for all 
experimental effects, so that effectively they are at the hadron level. Corrections for 
detector inefficiencies and resolution effects have been taken into account by an unfold- 
ing procedure. The theoretical predictions have been corrected for non-perturbative 
effects, either in a Monte Carlo framework, or by the addition of non-perturbative 
corrections to parton-level predictions. 

Note that the latter approach is often ignored by experimenters at the LHC, outside 
of the Standard Model groups, even though in many cases it offers the highest precision 
comparison. This is especially true if the cross section has been calculated to NNLO. 
If the cross-section is suitably inclusive, as for example the transverse momentum 
distribution for the leading jet in Higgs+> 1 jet events, resummation effects should be 
small, and a comparison of the data to a NLO or NNLO fixed order prediction should 
certainly be carried out.” 

Cross-sections for Standard Model processes at the LHC have been measured over 
14 orders of magnitude, as shown in Fig. 9.1 for the ATLAS experiment. In general, 
SM predictions are in remarkable agreement with data. It is a hallmark of the abilities 
of the human mind that the technology developed and the calculations made in the 
past decades prove to be so amazingly powerful and accurate. 

The only grain of salt in this success story is that up to now no clear sign of 
any physics Beyond the Standard Model (BSM) has shown up, despite a plethora of 
searches.? The higher energy (and higher integrated luminosity) for Run II holds the 
promise that a threshold for new physics may be reached. However, it is clear that the 
signatures for new physics may be subtle and a thorough understanding of pQCD is 


1At the time of the writing of this book, first measurements at 13 TeV were available. However, 
comparisons have been limited to data from the 7 and 8 TeV running. 

2For this example, the cross-section is not totally inclusive, in the sense that a pr requirement 
has been imposed on the jets, typically 30 GeV. But, the resulting restriction on the phase space for 
gluon emission is minimal and the effects of resummation thus are suitably small [198]. 

3Unfortunately, the 750 GeV bump in the diphoton mass spectrum appears to have gone away in 
the most recent 13 TeV data. 
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9.1 A 
tions measured by the ATLAS experiment in Run I, 


Fig. 


taken from 


atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/CombinedSummaryPlots. 


Reprinted with permission from CERN. 
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necessary for such discoveries to take place. 

In this chapter, data for relatively inclusive strong interaction observables are pre- 
sented, including total cross-sections, particle spectra in Minimum Bias events, and 
the Underlying Event, which constitutes an important and non-negligible nuisance for 
nearly every LHC measurement, cf. Section 9.1. This is followed in Section 9.2 by a 
discussion of pure QCD production of jets, ranging from inclusive jet spectra to mul- 
tijet topologies, which also play a role in some of the searches for BSM physics. Next, 
a benchmark signature, namely Drell-Yan production, will be discussed in Section 9.3, 
including total and differential production rates of the heavy gauge bosons. Topologies 
of single electroweak gauge bosons accompanied with jets are covered in Section 9.3.4. 
This is followed by some data for di- or multi-boson production, including diphotons, 
in Section 9.4. These processes were an important background for the discovery of the 
Higgs boson, and, in general, are also important for BSM signatures involving leptons. 
One of the most dominant particle production processes at the LHC is related to the 
production of top-quarks, either in pairs or as single-top. Its large cross-sections render 
the LHC a prime laboratory for precision studies of this heaviest quark, which in turn 
also means that tops play an important role as background for nearly every important 
signal process at the LHC. Data for this class of processes, including a discussion of 
measurement of the top quark mass and of the QCD radiation pattern is presented in 
Section 9.5. Having discussed all relevant Standard Model candles, finally signals for 
and signatures involving precision studies of the Higgs boson will be presented, Sec- 
tion 9.6. One caveat is that unlike the TEVATRON or HERA, results are still coming 
out of the LHC at a furious rate. So the results shown in this chapter are a snapshot 
taken at the time of writing of this book and do not represent the full on-going LHC 
story. 

Of course, the data presented here constitute only one aspect of the overall LHC 
programme, which also encompasses B physics and the study of highly energetic col- 
lisions of heavy ions. While these indeed are intriguing subjects, they somewhat fall 
outside the remit of this book and the authors apologize for this shortcoming. 


9.1 Total cross-sections, minimum bias and the underlying event 


The most inclusive observables in hadronic collisions are the total and elastic cross- 
sections, Ctot and Gej, the latter possibly given in the form of a distribution with 
respect to the momentum transfer t between the hadrons. However, quite often these 
observables are not trivial to measure, and instead inclusive, predominantly soft, parti- 
cle production studies constitute the material for the first physics publication related 
to collisions. This indeed was the case at the LHC, where the Minimum Bias data 
were the first to appear. In this section, however, the presentation will follow an order 
from the more inclusive to the more exclusive processes, starting with total hadronic 
cross-sections, touching on Minimum Bias data,then discussing the Underlying Event 
in different settings before, finally, some first results for double-parton scattering mea- 
surements will be presented. 
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Table 9.1 Results of measurements of total cross-sections otot, elastic cross-sections cel, and 
the elastic slope B at the LHC. 


Fig. 9.2 The differential elastic cross-section do. /d|t| at the 7 TeV LHC. 


Collab. | Eom. Result 
ATLAS 7 TeV || otot = 95.35 + 0.38 (stat.)+1.25 (exp.)+0.37 (extr.) mb [33 
B = 19.73 + 0.14 (stat.)+0.26 (syst.) GeV~? 
ATLAS 8 TeV || otot = 96.07 + 0.18 (stat.)+0.85 (exp.)+0.31 (extr.) mb [1 
B = 19.74+ 0.05 (stat.)+0.25 (syst.) GeV~? 
TOTEM | 7 TeV || oto = 98.0 + 2.5 mb [167 
Og = 25.141.1 mb 
Oinel = 72.9 + 1.5 mb 
8 TeV || oto = 101.7 + 2.9 mb [166 
Oea = 27.1 + 1.4 mb 
Oinel = 74.7+1.7 mb 
a IE 
$ E | F statistical errors 
Se |) aa r 
L : 
10? |- 
107 |= 
E Taam 
10t- 
107 


Reprinted with permission from Ref. [167]. 
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9.1.1 Total and differential pp cross-sections 


At the LHC, there are a few measurements of the total cross-section, typically based on 
the optical theorem, which are summarized in Table 9.1. The elastic cross-section 
Gel, or the elastic slope B, serve as important inputs to this kind of determination of 
Otot and they are therefore quoted here as well. The crux of this method is that the 
total and elastic cross-sections are related through 


2 167 dog 16r 1 dNa 


ee Z , 9.1 
t 1+ ditl h-o 1+ £ die] fio Oy) 


where t is the momentum transfer between the protons, and the elastic cross-section 
is obtained from the number of elastic events Ne divided by the integrated luminosity 
L. Here p ~ 0.15 captures the effect of the small real parts of the elastic amplitude. 
A similar expression relates the elastic slope B (the slope of the elastic cross section 
at |t| > 0) to the total cross-section. Obviously, in this type of determination of otot, 
one must extrapolate the differential elastic cross-section do.) /d|t| and it is important 
extend the measurement to the smallest possible t. Results from the TOTEM collabora- 
tion [167], including parameters of simple fits to different features, on this observable 
are exhibited in Fig. 9.2, where the peak as |t| — 0 is evident.* 

It is worth noting that in contrast to the TEVATRON with its pp collisions,’ it is 
possible at the LHC to use the van-der Meer method [877], which allows the direct 
determination of the luminosity in collisions of like-sign charged particles. Practically 
eliminating the large systematic uncertainties due to the luminosity in turn translates 
to significantly reduced uncertainties on the cross-section measurement. The results 
of the measurements quoted here put the cross section fits and the models and as- 
sumptions underlying them, cf. Section 7.1.3, to a stringent test; many of the models 
predicted o>; and related quantities to be somewhat larger than observed. 

Moving on to measurements of inelastic cross-sections, it is fairly obvious that 
one could define the inelastic cross-section as the difference of total and elastic cross- 
section, 

Cinel = Otot — Oel. (9.2) 


This is precisely what the TOTEM collaboration uses in their determination of the 
inelastic cross-sections, which for this reason are also quoted in Table 9.1. However, 
quite often a distinction is being made between low-mass diffractive and “truly” 
inelastic events. Defining the scaled diffractive mass of the dissociating proton Ç 
through 


(9.3) 


inelastic events are defined as those events where the larger of the two diffractive 
masses Mx — or, correspondingly Ç — is larger than some critical value. At the LHC 


4The 8 parameter defines the beam envelope. Its value at the collision point is termed 8*. For the 
highest luminosities,it is desirable to have a small value of 8». Measurements of the elastic scattering 
cross-section, however, require special runs with large values of 8x. 


5 As a reminder, the luminosity uncertainty at the TEVATRON was on the order of 6%. 
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Table 9.2 Results of measurements of inelastic cross-sections Gine] in different definitions 
at the LHC at 7 TeV. For the ALICE result, the relatively tight diffractive mass cut was 
extrapolated down to zero, introducing a modeling uncertainty. ¢ is defined in Eq. (9.3). For 


the CMS measurements, the tracks must be in the pseudorapidity region of |ņ| < 2.4, with 
pı > 200 MeV. 


Collab. | Def. Cinel [mb] 
ALICE | Mx > 200 GeV || 73.2 + 2.6 (lumi)??? (model) mb [114] 
ATLAS | ¢>5-10-° 60.3 + 0.05 (stat.)+0.5 (syst.)+2.1 (lumi) mb [5] 
CMs €S5- 1075 60.2 + 0.2 (stat.)£1.1 levee Es 4 (lumi) mb [374] 
>1 track 58.7 + 2.0 (syst.)+2.4 (lumi) m 
>2 tracks 57.2 + 2.0 (syst.)+2.4 (lumi) a 
>3 tracks 55.4 + 2.0 (syst.)+2.4 (lumi) mb 


Mx > 15.7 GeV or Ç > 5-10~° is often used.® Events that do not satisfy this condition 
are then dubbed single- or double-diffractive, and their cross-section is often given 
w.r.t. the inelastic one. The corresponding results at c.m.-energies of Eom. = 7 TeV 
are exhibited in Table 9.2. 

In Fig. 9.3 the results for the total and elastic cross-sections in pp and pp colli- 
sions are compared to data from lower energy measurements and from (higher energy) 
cosmic ray measurements. It is clear from the collider data that the cross-sections 
continue to increase logarithmically with the centre-of-mass energy of the reactions. 
Looking at the data from the highest energies, obtained by astroparticle experiments, 
it also appears that the data continue to rise with energy above the current collider 
reach. The unitarization of the total hadronic cross-section has not set in, and 
the Froissart bound is still ahead of us. 


9.1.2 Minimum Bias physics and inclusive hadron spectra 


By definition, Minimum Bias (MB) data provide a very inclusive picture of particle 
collisions. In the case of pp collisions, the MB data encompass a plethora of physics 
processes, with the bulk of events dominated by the production of relatively few and 
relatively soft particles. This is indeed identical to the case of pp collisions, as studied 
at the TEVATRON, with some results presented in the previous chapter in Section 8.1. 

As in the TEVATRON case, the production mechanisms can be classified as elastic 
hadron scattering, diffractive scattering and inelastic particle production. 
But of course, the boundaries are somewhat blurred, in particular between diffractive 
and inelastic events. Common lore has it that the former are characterized by the 
emergence of rapidity gaps. These are empty regions of rapidity in the particle 
production phase space, with no particles in them. Typically, diffractive events are 
thought to have such rapidity gaps with sizes of the order of a few units, typically 
about 3 or more. Conversely, from a theoretical perspective, such events are thought to 
emerge when pomerons are involved in the process. In fact, in most event generators, 


6Mx is the (diffractive) mass of the system emerging from the proton breakup, and Eec.m. is the 
hadronic centre-of-mass energy. 
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Fig. 9.3 Total and elastic cross-sections for pp and pp scattering vs. cen- 
tre-of-mass energies ranging from a few GeV to 60 TeV,along with the 
results of a global fit. Reprinted with permission from Ref. [1]. 


such a distinction of the different event categories is made, and in the simulation of 
MB events, quite often an admixture of inelastic and single and double diffractive 
event samples must be used. Historically, this has led to correcting the MB data for 
diffractive events, which are effectively subtracted by the use of event generators, 
typically PYTHIA. Quite often the data have been extrapolated to the full phase space, 
i.e. to all pseudorapidities and to zero transverse momentum, again by invoking the 
simulation tools. This can be risky. 

MB measurements at the LHC include only those events which satisfy relatively 
inclusive requirements on visible particles, such as the requirement of a certain, small 
number of (charged) particles with a minimal, usually low, transverse momentum 
inside the acceptance region of the detector. Usually, this boils down to the requirement 
of something like one to six charged particles with a minimal transverse momentum 
between about 100 and 500 MeV inside a pseudo-rapidity regime given by |n| < 2.5 or 
similar. 

In Fig. 9.4 such data, taken by the ATLAS collaboration [2] are shown, based on 
events with at least one charged particle, where all charged particles are inside the 
interval |n| < 2.4 and have transverse momenta py > 500MeV. The particles are 
distributed relatively evenly in pseudo-rapidity, forming a plateau. (If a smaller cut 
on the transverse momentum (100 MeV) is instead used, the distribution has a slight 
peaking around |n|=2.) All MC distributions predict a flat 7 distribution, but there 
are some differences in the normalization. The PYTHIA 6 ATLAS MCO9 tune was fit 
to data from 200 GeV to 1.96 TeV. The PYTHIA 6 AMBT1 tune derived from the 
MCO9 tune, but was also fit to early LHC minimum bias data from 0.8 and 7 TeV. It 
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Fig. 9.4 Single charged particle spectra in Minimum Bias events at the 
LHC at 7 TeV: pseudorapity left), transverse momentum right). All results 
refer to events with at least one charged particle in the interval |ņ| < 2.4 
with a transverse momentum of p1 > 0.5GeV. Reprinted with permission 
from Ref. [2]. 


is not surprising then that the best agreement is with the AMBT1 tune, although the 
MCO09 tune also works well. In any case, the charged particle density for MB events at 
the LHC is larger than that observed at the TEVATRON (Section 8.1). The transverse 
momentum distribution falls seven orders of magnitude between 500 MeV and 10 GeV. 
The high transverse momentum range is hard to fit; here, the AMBT1 and MC09 tunes 
actually perform the worst. 

In Fig. 9.5 (left), the charged particle multiplicity distribution is shown for MB 
events. The various predictions agree reasonably well with the data and each other 
at low Nen, but there are clear deviations at large values of nen, and no prediction 
describes the data well. The mean transverse momentum is plotted versus the track 
multiplicity in Fig. 9.5 (right). The figure indicates that the greater the charged track 
multiplicity, the larger the mean track multiplicity is. This is not surprising in that 
the larger the charged particle multiplicity is in an event, the more likely it is that the 
protons have suffered a violent collision. One possible surprise is the degree of linearity 
of the correlation for charged particle multiplicities above 20. The ATLAS AMBT1 tune 
describes this correlation the best. 

It is clear that such inclusive distributions present a formidable challenge to our 
understanding of the data and of the strong interaction. Typically, the level of agree- 
ment is in the range of a few to about 30%, even for the steepest distributions. That 
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Fig. 9.5 Single charged particle spectra in Minimum Bias events at the 
LHC at 7 TeV:charged particle multiplicity left), mean transverse momen- 
tum versus track multiplicity (right). All results refer to events with at least 
one charged particle in the interval |ņ| < 2.4 with a transverse momentum 
of p1 > 0.5GeV. Reprinted with permission from Ref. [2]. 


the event generators are able to describe the data to this level is somewhat remarkable, 
even acknowledging that the MC parameters were often tuned to this data, in addition 
to lower energy data. This agreement could not necessarily have been expected from 
the beginning, especially keeping in mind that the simulation of MB events is based 
on relatively primitive paradigms, namely multiple parton—parton scattering in 
collinear factorization (see also Sections 7.1 and 7.2). While this provides some con- 
fidence that inclusive features of MB physics are under control well enough to allow 
for more complex measurements, it should not be forgotten that the agreement is less 
than perfect. 

One of the places where such cracks in the otherwise relatively good description of 
MB data shows up, is in the production of individual hadron species, something that 
may be dubbed “hadro-chemistry, and in particular in the production of hadrons 
with multiple strange quarks or of baryons. As an example, consider the case of hyperon 
and cascade production (A and & production), studied by the CMs collaboration [655], 
with some characteristic distributions shown in Fig. 9.6. Not surprisingly, the rapidity 
distributions for both the A and =~ are fairly flat, as in the inclusive case above. 
However, the normalizations of the predictions are off by sizable factors at both 0.9 
and 7 TeV for all predictions. The ratio of =~ /A production is also more or less flat 
in rapidity, and the normalization of the predictions is again off. The ratio of MC to 


Total cross-sections, minimum bias and the underlying event 555 


> T > C J 
me) F 4 oS T | 1 
Po i | ee 0 027 <ER + 
a Q ji , Gi 
Z 015r zZ [ a | 
= | =0.015} 4 
0.1- 2 aie bo 

: SS ot eT Mae adhe 

0.057 Vs=7Tev -=\s=0.9Tev | 0.005 [sss si ae 

L —  PYTHIA6D6T —— PYTHIAG D6T - p= Ea Det -5 7 ORAS Gat | 

H PYTHIA6 PO ~ PYTHIA6 PO J L a 4 

oL u PymHias | PYTHIAS o PYTHIAB. |, 1 PYTHIAB | 

0 0.5 1 1.5 2 0 0.5 1 1.5 2 

Aly = |y 


Fig. 9.6 Strange particle spectra in Minimum Bias events at the LHC at 
7 TeV: The rapidity distribution of A baryons is shown on the left and the 
rapidity distribution of Cascade baryons € is shown on the right, compared 
to several MC predictions. Reprinted with permission from Ref. [655]. 


data, for the transverse momentum distributions of several strange particles, is shown 
in Fig. 9.7, again at both 0.9 and 7 TeV. There is a sizable model dependence to the 
description of these data, and no MC prediction describes the data well. 

When comparing these data with various MC predictions, it is obvious that the 
event generators struggle to achieve an agreement for baryons which is of similar 
quality as in the inclusive case in Fig. 9.4 and Fig. 9.5. This is a testament to the 
fact that the parameters of hadronization have been tuned to e~e* annhilation data, 
typically from LEP 1, involving no incoming hadrons and therefore no highly energetic 
sources of additional colour such as the beam remnants in hadronic collisions. This is 
a clear hint at some deficiencies in our current understanding of some of the aspects of 
particle production at the LHC, while predictions for the bulk of particle production 
are under reasonably good control. 

Another region where the description of data is less than perfect has been studied 
by the ATLAS collaboration in [16], namely the emergence of rapidity gaps. These 
are regions in pseudorapidity where no particle above a certain minimal transverse 
momentum is observed. In this study, rapidity gaps with respect to the edges of the 
detector at 7 = +4.9, and in different bins of the minimal pı, are analysed. The 
largest rapidity gap, Anr, is reported. Diffractive events are responsible for the region 
of large rapidity gaps. Regions of relatively small gap sizes, of up to Anr ~% 3, are 
dominated by fluctuations in standard QCD events, which emerge from the absence 
of parton radiation into that region (and its interplay with the hadronisation model). 
It is clear, therefore, that for p} of up to O (1 GeV), this is a highly model-dependent 
statement, with non-perturbative effects such as colour reconnections or so having a 
strong impact. This in turn depends on the admixture of non—diffractive and diffractive 
events, rendering this type of observable an interesting testbed for various reaction 
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Fig. 9.7 The ratio of MC predictions to data for K?, A and = transverse 
momentum distributions. Reprinted with permission from Ref. [655]. 


mechanisms. 

In Fig. 9.8 (left), the cross-section as a function of An” is shown for particles 
with transverse momenta above 200 MeV (the lowest limit for this measurement), 
along with several MC predictions. The PYTHIA tunes have been fit to ATLAS data. In 
Fig. 9.8 (right), the same data is shown, now compared to separate predictions, from 
PYTHIA 8, of the non-diffractive, single diffractive and double diffractive components. 
Note the exponentially falling non-diffractive component at small gap sizes. At larger 
gap sizes,there is a plateau, corresponding to a combination of single-diffractive and 
double-diffractive processes. All MC models are able to reproduce the general trends 
of the data, but none provide a perfect description. 

It is noteworthy that the observables are defined entirely on the basis of visible, 
and therefore physical final states, allowing for easier (and unbiased) interpretation of 
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Fig. 9.8 Pseudorapidity gap distributions at 7 TeV. The cross-sections are 
measured differentially in terms of Anr, the largest of the pseudo-rapidity 


regions extending to the limits of the ATLAS sensitivity, at 7 = +4.9, in 
which no final state particles are produced above a transverse momentum 
threshold pı cut.The ATLAS data are compared to several MC predictions 
on the left, and to a decomposition into contributions from non-diffrac- 
tive, single-diffractive and double-diffractive on the right. Reprinted with 
permission from Ref. [16]. 


the data in the context of MB physics models. 


9.1.3 The Underlying Event 


The Underlying Event (UE) is to some degree related to Minimum Bias (MB) 
physics, since in most simulations both aspects of relatively soft QCD are described by 
multiple parton—parton scattering. As in the first UE analyses at the TEVATRON, 
it has become customary to divide the azimuthal phase space into a “towards”, an 
“away”, and a two-part “transverse” region such that each of the regions covers an 
identical size of 27/3 or 120°, cf. Fig. 8.5. In some analyses, the two parts of the 
transverse region, are further decomposed into the “trans-min” and the “trans-maz” 
part, depending on their occupation with particles or energy. Usually the regions are 
oriented in such a way that the signal object — the hardest charged track, the jet—axis 
or the lepton pair — is at the centre at 0° of the towards region, which in turn ranges 
from 300° to 60°. The away region then covers the interval [120°, 240°] and the two 
parts of the transverse region are in [60°, 120°] and [2400°, 300°]. 

The first published UE analysis at the LHC, undertaken by the ATLAS collabora- 
tion [6], relied on the leading charged track to orient the regions; its results are depicted 
in Fig. 9.9. The observables that are most sensitive to UE are the density of charged 
particles in the transverse region and the sum of their p; ; in addition, ATLAS reported 
the standard deviations of these observables and the average p_, all as functions of 
the leading track pı and applying a cut of pı > 500 MeV on all particles. 

The qualitative features of the distributions are the same as at the TEVATRON; there 
is a rapid rise with the leading track p,, then a plateau at higher leading track p]. 
The predictions shown in Fig. 9.9 are all derived from MC tunes based on TEVATRON 
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Fig. 9.9 Underlying Event observables with respect to the leading charged 
track transverse momentum in the transverse region at LHC at 7 TeV: the 
charged particle density Neng (left) and the mean of the charged particle 
transverse momentum, (p1) (right). Only charged particles with |7| < 2.4 
and pi > 500 MeV have been considered. Reprinted with permission from 
Ref. [6]. 


data. It is interesting to note that all of the predictions fall short of the ATLAS data 
at 7 TeV. The agreement is better, typically within 10% or less, for predictions with 
subsequent tunes that have incorporated this data, as might be expected. 

A similar analysis based on jets rather than leading tracks has been presented by 
CMS [362] and several of the results are displayed in Fig. 9.10. By the use of jets, 
the range of the measurement is greatly extended from that obtained using only the 
leading track. The mean charge density and the mean summed transverse momentum 
are shown in the figure, compared to several MC predictions. The best agreement is 
with the PYTHIA predictions using the Z1 tune. However, data from CMS at 7 TeV was 
used in defining this tune, so the level of agreement is not surprising. The other tunes, 
developed using lower energy data, also provide good agreement for the mean charge 
density distribution, but not for the mean summed transverse momentum distribution. 

Fig. 9.11 provides a more differential look at the UE data, using the same ob- 
servables as in the previous plot. Good agreement between the data and the MC 
predictions is found, especially for PYTHIA using the Z1 tune. 

The event generators generally work well at the LHC for the description of the UE. 
This finding further strengthens the statement made in the discussion of MB data, that 
the event generators are adept in describing the bulk of LHC events. This indicates 
that the ideas underlying the construction of the non-perturbative models responsible 
for the simulation of MB and UE physics cannot be completely off the mark. However, 
similar to the case of MB, some deficiencies in the UE simulations start to appear 
when going to more taxing regions of phase space. 
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Fig. 9.10 Underlying Event observables with respect to the leading jet 


transverse momentum in the transverse region at LHC at 7 TeV: the charged 
particle density Neng (top left), the sum of the charged particle transverse 
momenta J` p1 (top right), and their distributions (bottom left and right), 


all with respect to the leading jet transverse momentum. Only charged 
particles with |n| < 2 and p1 > 500 MeV have been considered. Reprinted 
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Fig. 9.11 Underlying Event observables with respect to the leading jet 


transverse momentum in the transverse region at the LHC at 7 TeV: the 
charged particle density Neng distribution (left), the sum of the charged 
particle transverse momenta J` pı distribution (right). Only charged par- 
ticles with |n| < 2 and pı > 500MeV have been considered. Reprinted 


with permission from Ref. [362]. 
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s 


Fig. 9.12 Some example diagrams illustrating single (left) vs. double-par- 
ton (right) scattering production of W jj 


9.1.4 Multiple parton scattering 


An interesting feature of the Underlying Event (UE) is that the secondary interactions 
of the hadron constituents that give rise to most of the UE can themselves become 
hard and give rise to physical objects such as additional jets, or even gauge bosons. 

To illustrate the latter consider the production of same-sign W pairs, such as 
W*W? pair production. Clearly, parton—level processes such uu — WtWtdd must 
be invoked already at the lowest order in the Standard Model. The cross-section for 
this process at the LHC is of the order of a few (about five) femtobarns, and, using the 
simplifed model for double-parton scattering (DPS), Eq. (7.43), would yield a similar 
size for the DPS production cross-section of W*WT pairs. This renders same-sign W 
pairs a smoking gun for double-parton scatttering. 

However, the first measurement of DPS at the LHC was achieved with the associated 
production of a W boson with two jets, Wjj, by the ATLAS collaboration in [17]. 
The defining feature of DPS in contrast to the production of the same final state in 
one single partonic interaction, cf. Fig. 9.12 for an illustation of the two production 
modes, is that in the former the W and the di-jet systems are kinematically decoupled, 
and each of them typically has a relatively small total transverse momentum. This 
motivated ATLAS to use the total transverse momentum of the di-jet system or the 
total transverse momentum of the two jets divided by the sum of their individual 
transverse momenta, 

v1 wads 
Ajets = lea + Pal (9.4) 
PEI + ee 
as sensitive observables. The latter, A%,, yields numbers between 0 — when the two 
jets balance each other exactly, with the same transverse momentum oriented back- 
to-back — and 1 — when the jets point into the same direction. In this observable, the 
DPS region is clearly related to relatively low values of this quantity. 

This is shown in the left panel of Fig. 9.13, where ATLAS data are compared with a 

combination of simulation results and a DPS sample constructed from data. The for- 
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Fig. 9.13 (left) The distribution of AFets is shown for Wjj events, along 
with the two templates described in the text. (right) The extracted value of 
Geff from this measurement is plotted along with other measurements of 
this parameter at the LHC and at lower energies. Reprinted with permission 
from Ref. [17]. 


mer, labelled as A+H+J is derived from a single-parton scattering simulation of W+ 
jets, using a combination of leading order matrix elements from ALPGEN [743] supple- 
mented with the parton shower of HERWIG [415] and its underlying event, obtained 
from JIMMY [298]. The latter is actually constrained to produce parton-parton scatters 
only with a transverse momentum below p, < 15 GeV, well below the jet cut in the 
data of pı > 20 GeV. The latter, labelled as template B, is taken from data, where 
inclusive W production events have been overlayed with di-jet events, both taken in 
the early stages of Run I at the LHC. Extracting the overall normalisation of template 
B from a fit corresponds to fixing the value of ceg in the simple model of Eq. (7.43). In 
the right panel of the same figure, Fig. 9.13 the result of this fit and one obtained from 
a similar analysis by the CMS collaboration [387] are compared with determinations of 
the cross-section from lower energy data. The values of cep are all consistent, except 
the result from the AFS collaboration. 

Studies of DPS scattering with four-jet topologies [379] or involving the prompt 
production of J/w pairs [656] will further contribute to a more detailled understanding 
of the underlying mechansim and will ultimately help to improve on the simple model 
with a single parameter oe. 


9.2 Jets 


9.2.1 Inclusive jet production 


At the LHC, in contrast to the TEVATRON, IR-safe jet clustering algorithms, in partic- 
ular the anti-kp algorithm, are universally used. As mentioned previously, the anti-kr 
jet algorithm was developed only near the end of the activity of the TEVATRON. In 
addition to its ease of use with fixed-order predictions, the anti-kp algorithm provides 
jets that are very close to perfect cone-shaped, allowing an easy determination of the 
effective jet area. ATLAS typically uses jet sizes of 0.4 and 0.6 [31], while CMS uses 
jet sizes of 0.5 and 0.7 [383, 405]. Both experiments will expand the range of jet sizes 
used, in particular to be able to compare directly to the other experiment’s results, 
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for example with a common jet size of 0.4. Here, we discuss CMS results for inclusive 
jet cross-section measurements with the anti-ky (D = 0.5) and anti-kr (D = 0.7) jet 
algorithms, at a centre-of-mass energy of 8 TeV. Similar results exist for ATLAS with 
the anti-kp (D = 0.4) and anti-kr (D = 0.6) algorithms. 

The calorimeter coverage for both ATLAS and CMS goes out to rapidities of the 
order of 4.5. However, many of the analyses involving jets restrict themselves to jets 
in more central rapidity regions (typically yjet < 2.5 — 3), where more tracking in- 
formation is available. The tracking information serves both to improve the energy 
resolution of the measured jet, and as a way of discriminating between jets from the 
event of interest and jets produced by pileup. Fewer tools are available at higher ra- 
pidities, but this region is still important for many physics results. Jet measurements 
at high y provide useful information in PDF fits and for discriminating between the 
VBF and gluon-gluon fusion mechanisms for Higgs boson production, for example. A 
number of analyses at both ATLAS and CMS have measured jets out to the full rapidity 
coverage, as will be shown in this chapter. 

In CMS, jet measurements are conducted primarily with the particle-flow event re- 
construction algorithm [400], in which tracking and calorimetry information is used in 
a framework optimized to provide the best jet energy resolution. An offset correction is 
used to remove the energy contributed by additional proton-proton interactions [303]. 
The offset method calculates a rapidity-dependent energy density p, which when mul- 
tiplied by the jet area, provides an indication of the energy to be subtracted from the 
jet. Most of the pileup is due to collisions in the same bunch-crossing, with smaller 
contributions from out-of-time pileup. Note, though, that unlike at the TEVATRON, the 
underlying event energy is subtracted by the offset correction along with the pileup 
energy. The non-perturbative corrections for the underlying event can effectively be 
added back to the data for easier comparison to Monte Carlo predictions. This offset 
method is becoming the standard for both experiments. 

Results for the measurement of the inclusive jet cross-section for the anti-kr (D = 
0.7) jet algorithm for all rapidity intervals are shown in Fig. 9.14 compared to NLO 
QCD predictions using CT10 PDFs, and modified by non-perturbative corrections [405]. 
A linear comparison of the data to NLO jet cross-section predictions with various 
PDFs, for the central rapidity region, is shown in Fig. 9.15. The jet measurement 
reaches transverse momenta greater than 2 TeV, with the integrated data sample of 
19.7 fb~+. Better agreement is observed for the anti-kr (D = 0.7) results than for the 
results using the anti-kr (D = 0.5) algorithm (not shown), perhaps indicating that 
while the NLO prediction (where at most two partons can be in a jet) describes the 
jet shape reasonably well, it does not describe the jet shape completely. See also the 
discussion below. 

Over most of the kinematic range, the experimental uncertainties are smaller than 
the theoretical uncertainties (both PDF and scale). The scale uncertainties have greatly 
improved upon completion of the NNLO jet calculation. Given the spread of PDF 
predictions, this data should be useful in parton distribution function fits. 

Note that NLO electroweak corrections are already on the order of approximately 
5% at 1 TeV and increase fairly rapidly with jet transverse momentum, cf. the simple 
estimate provided in Eq. (3.232) and Fig. 3.12. This estimate is confirmed by an exact 
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Fig. 9.14 A comparison of the CMS inclusive jet cross-section measured 
using the anti-kr (D = 0.7) jet algorithm to NLO predictions using the 
CT10 PDFs. Reprinted with permission from Ref. [405]. 


calculation of Ref. [469]. 

Many of the systematic uncertainties, both experimental and theoretical, cancel in 
the ratio of measurements of jet cross-sections for two different jet sizes. The ratio for 
anti-kr (D = 0.5) to anti-kp (D = 0.7), using CMS data from 7 TeV [383], is shown in 
Fig. 9.16 The ratio of the two cross-sections starts at approximately 0.7 and rises as 
the jet transverse momentum increases, as expected given the increasing collimation of 
the jet. Fixed-order predictions, either at LO or NLO, do not describe the shape of the 
measured ratio. Better agreement is provided by the incorporation of non-perturbative 
corrections to the fixed-order predictions, but the best agreement is achieved by the 
POWHEG +PYTHIA 6 prediction, which combines a NLO matrix element calculation 
within a parton shower Monte Carlo framework. It will be interesting to compare the 
calculated NNLO (with non-perturbative corrections) ratio to this data, to see to what 
extent the higher order prediction can describe the jet shape. Similar results have been 
reported by ATLAS [81]. 

At the time of writing of this book,the 7 TeV data has been incorporated into 
global PDF fits [194, 489, 614], using the larger jet size where the NLO predictions 
may be more applicable. 


9.2.2 Dijet production 


Measurements of the inclusive dijet cross-section provide a means to test precision 
predictions of perturbative QCD at the highest mass scales achievable at the LHC. Such 
comparisons can help to constrain the high-x gluon distribution, as well as to search 
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Fig. 9.15 The ratio of the CMS inclusive jet cross-section measured using 
the anti-kr (D = 0.7) jet algorithm to NLO predictions using the CT10 
PDFs, as well as the ratios of NLO predictions using other PDFs to pre- 
dictions using CT10. Reprinted with permission from Ref. [405]. 


for the presence of new contact interactions, for example due to quark compositeness. 
At the LHC, the dijet cross-section has been measured out to dijet masses of 5 TeV. 
Typically, the measurement is divided into bins of y*, with y* defined as half of the 
absolute value of the rapidity difference between the two jets. Using the kinematic re- 
lations of Section 4.1 it is straightforward to show that, at leading order, this quantity 
is related to the centre-of-mass scattering angle 6* by | cos 6*| = tanh y*. Thus high 
values of y* correspond to larger values of cos 0*. A measurement of the dijet mass 
cross section, in bins of y*, carried out by ATLAS [27] is shown in Fig. 9.17, compared 
to NLO parton level predictions from NLOJET++. The measurement in the plots has 
been carried out with the anti-kp (D = 0.6) jet clustering algorithm (similar mea- 
surements are available with the anti-ky (D = 0.4) algorithm). Jets are reconstructed 
using topological cell clusters [39]. These clusters are determined from calorimeter 
cells and local hadronic calibration weighting. The latter depends on the good 3-D 
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Fig. 9.16 The ratio of the CMS inclusive jet cross-section measured us- 
ing the anti-kr (D = 0.5) jet algorithm to the jet cross-section using 
D = 0.7. Comparisons are made to LO and NLO theoretical predictions, 


with and without non-perturbative corrections. Reprinted with permission 
from Ref. [383]. 


(lateral and longitudinal) segmentation of the ATLAS calorimeters. Pileup corrections 
are determined from Monte Carlo calculations, as a function of the number of track 
vertices in the event (for in-time pileup) and the average instantaneous luminosity at 
the time of the event (for out-of-time pileup), in bins of jet pr and rapidity. Note the 
differences with respect to the technique described in the previous section for the CMS 
inclusive jet analysis. ATLAS is switching to an offset method similar to that described 
for CMS. 

The NLOJET++ predictions have been corrected for non-perturbative effects. In the 
calculations, a common renormalization and factorization scale of p7'“* exp (0.3y*) has 
been used, where p7'* refers to the transverse momentum of the largest jet. For values 
of y* near zero, and for 2 — 2 scattering, the scale reverts to just the jet transverse 
momentum. As noted in Section 4.1, the peak inclusive jet cross-section at NLO tends 
to move to higher scales as the jet rapidity increases. The same is true for dijet cross- 
sections as a function of y*, and the scale choice given above tends to be near the peak 
value of the dijet cross section at high y*. In fact, at very high dijet mass, and at large 
values of y*, a scale choice close to p°” can actually lead to negative cross-sections 
at NLO. 

Electroweak corrections have also been taken into account in the data comparisons. 
These corrections are typically less than 1% for y* > 0.5, but can be larger than 9% for 
high dijet mass (> 3 TeV) for y* < 0.5, as shown in Fig. 9.18 . The corrections include 
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Fig. 9.17 The dijet mass cross-section from ATLAS as a function of y* 
compared to predictions from NLOJET++ using several PDFs. Reprinted 
with permission from Ref. [27]. 


both tree-level effects of order (aas, a?) and weak loop effects of order (aa?) [469]. 

The probability for the NLO predictions to describe the data, taking into account 
the experimental uncertainties, is indicated in the plots for each y* bin. Both the CT10 
and HERAPDF1.5 PDFs describe the data well, except perhaps in the y* interval from 
1.0 to 1.5. Contact interactions preferentially produce events at low y* compared to 
QCD. Thus, the most sensitive region to search for the effects of contact interactions 
is at low y* (< 0.5) and high dijet mass (> 1.31 TeV). Using a model of QCD+contact 
interactions with left-left coupling and destructive interference, the ATLAS data is 
sufficient to exclude contact interactions at a scale A less than 7.1 TeV (using the 
CT10 PDFs). Similar results are also available from CMS [8376]. 

Dijet production is not expected to be well-described by fixed-order predictions 
in kinematic configurations where either a large rapidity interval exists between the 
two jets, and/or there is a veto on the existence of a third jet in the rapidity interval 
bounded by the dijet system. In these situations, higher order corrections can become 
important, and logarithmic terms depending on the rapidity separation between the 
two leading jets’ or on the average transverse momentum of the dijets may need to 
be resummed in order to achieve a good description of the data. 

The region where two jets, with a fixed pr threshold, are separated by a large 
rapidity interval corresponds to a large value of § and a small value of f. Such regions 


7Technically, the logarithmic terms depend on the dijet mass, but in situations where the jets 
are separated by a large rapidity interval and the transverse momenta of the jets are similar, the 
argument of the logarithm reduces to the rapidity separation. 
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Fig. 9.18 Electroweak corrections for the dijet mass cross-section from 
ATLAS for several different y* bins. Reprinted with permission from 
Ref. [27]. 


are dominated by t-channel gluon exchange and one expects a linear growth of jet 
multiplicity with increasing Ay;;. This can be observed in Fig. 9.19, which shows the 
mean number of jets (defined by pr > 20 GeV) in the rapidity interval bounded by the 
dijet system, for different average transverse momenta for the tagging jets. The ATLAS 
data is compared to predictions from HEJ and POWHEG [3]. Here, the Ay;; interval 
is defined by the two jets in the event that have the greatest rapidity separation. If 
instead, the two highest pr jets were used, the growth with Ay;; would be reduced 
by about a factor of two [3]. Thus, rapidity ordering may be more efficient in rejecting 
gluon-gluon fusion production of a Higgs boson, in order to measure VBF Higgs boson 
production, than using the two highest pr jets, as is currently done. The reason for the 
faster increase with rapidity ordering can be easily understood. If the bounding jets 
are the two highest pr jets, then any additional jets produced in the gap are required 
to be in the transverse momentum range determined by the jet cutoff and the second 
highest pr jet in the event. For rapidity ordering, there is no such bound, and there 
is effectively a larger phase space. There are practical considerations, however, in the 
experimental difficulties with dealing with jets at very forward rapidities, especially in 
high pileup conditions. 
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Fig. 9.20 The ATLAS inclusive jet multiplicity distribution at 7 TeV com- 
pared to LO and NLO predictions from NJET. The jets are measured using 
the anti-kr (D = 0.4) jet algorithm. The plot is taken from Ref. [183] using 
data from Ref. [4]. Reprinted with permission from Ref. [183]. 


In the figure, comparisons are made to POWHEG, coupled either with the PYTHIA 
or HERWIG parton shower and to HEJ, at the partonic level. The POWHEG predic- 
tions include a full NLO partonic description of the dijet system and the PYTHIA and 
HERWIG parton showers provide a resummation of soft and collinear gluon radiation. 
The HEJ formalism provides a leading logarithmic resummation of terms proportional 
to the rapidity separation of the two jets, embedded in a framework that includes 
fixed-order corrections from multi-jet matrix elements. 


9.2.3 Multijet production 


Multijet production at the LHC is an interesting and important process, in that it 
allows for precision tests of the perturbative QCD framework, serves as a platform 
for measuring the running of a,, and also forms a background for many types of 
new physics. Final states with 7 or more jets have been measured at the LHC [4, 
31, 404, 663], and NLO predictions are available for states with up to 5 jets [183]. 
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ATLAS, for example, has measured final states with up to 6 jets at 7 TeV, with the 
requirement that the leading jet have a transverse momentum greater than 80 GeV 
and additional jets have transverse momentum greater than 60 GeV. The anti-kr jet 
algorithm with D = 0.4 and D = 0.6 was used. A comparison of the ATLAS data for the 
jet multiplicity distribution is shown in Fig. 9.20, along with LO and NLO predictions 
from NJET [183]. The NLO calculation significantly decreases the scale uncertainty 
from that obtained at LO, and in general is in better agreement with the data, with 
the exception of the 2-jet bin, where there are large negative NLO corrections. For 
the higher jet multiplicities (for the bins where NLO predictions are available), the 
ratio between theory and data is in the range of 1.2-1.3. As Ref. [183] notes, the main 
driver for the difference between the LO and NLO predictions is the use of LO PDFs 
for the former. If NLO PDFs are used for both predictions, the results at the two 
orders are very close. In particular, if a scale of Hr /2 is used, the ratio of the NLO to 
LO predictions tends to be very flat as a function of the relevant kinematic variables. 
This same behaviour has been observed for W/Z+jets at the TEVATRON, as discussed 
earlier, and will be encountered again in the context of W/Z+jets at the LHC. 

In Figure 9.21 (left), the ratio of the cross-section for the production of (n + 1) 
and n jets for the ATLAS data is compared to NLO predictions using several PDFs. 
Within uncertainties, the predictions agree with the ATLAS data for 04/03 and 05/04. 
In this case, the LO and NLO predictions for the ratios are within 10% of each other 
due to cancellations of PDF effects. In Figure 9.21 (right) is shown the 03/02 ratio 
(for D = 0.6) as a function of the lead jet transverse momentum. Good agreement is 
obtained for predictions from all PDFs at large leading jet transverse momentum. 


9.2.4 Jet substructure 


Knowledge of the jet four-vector allows for calculation not only of the transverse 
momentum and rapidity of the jet, but also of an additional degree of freedom, the 
jet mass. Jets acquire mass dynamically through perturbative gluon radiation, and 
to some extent through the non-perturbative fragmentation process. As with the jet 
shape, the one hard gluon present in a NLO jet calculation describes the perturbative 
contribution to the jet mass reasonably (but not perfectly) well, i.e. a parton shower 
is not required for tolerable agreement. 

The general form for a typical jet mass can be determined through dimensional 
analysis: the dominant contribution to the jet mass squared (at NLO parton level) 
should scale with the square of the transverse momentum of the jet, with the square 
of the size of the jet R, and be proportional to one factor of as, where the argument 
of a, should be related to the transverse momentum of the jet: m? x as(pr) pR. 
Thus, there is a roughly linear dependence of the jet mass on both the pr and the size 
of the jet. Of course, in any particular event, the gluon radiation process is stochastic, 
and thus there will be a great variation of jet masses in practice, as will be shown 
later. In general, the jet mass distribution will be strongly suppressed for low jet 
masses (this corresponds to little or no gluon radiation), will rise roughly linearly to 
a peak value, and then fall off slowly with jet mass, with a slope between 1/m and 
1/m?. The peak of the jet mass distribution occurs for m ~ (0.1 — 0.2)prR, and is 
dominated by multiple soft wide-angle gluon emissions. There is also a shoulder at 
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Fig. 9.21 (left) The ratio of the cross-sections for (n + 1) and n-jet pro- 
duction, for n = 2, 3 and 4, in ATLAS data [4] and from the theoretical 
predictions of NJET +SHERPA. The jet clustering uses the anti-kr (D = 0.4) 
algorithm. (right) The 3-jet to 2-jet ratio as a function of the leading jet 
transverse momentum using anti-kr (D = 0.6) jet clustering. Predictions 
are shown at LO and NLO for several PDFs. Reprinted with permission 
from Ref. [183]. 


larger masses (0.3 < m/(prR) < 0.5) dominated by a single hard gluon emission. 
This shape means that the average jet mass is above the peak value. A simple rule- 
of-thumb, that describes the result of an exact NLO calculation of the jet mass to 
reasonable accuracy, is that the average mass of a jet at the LHC (13 TeV) measured 
with the anti-kr jet algorithm is approximately given by (m) = 0.16 prR [508]. Since 
the factor of a, should be accompanied by the colour charge of the hardest parton 
in the jet, gluon jets should have an average mass a factor of ,/C4/Cr greater than 
quark jets. The general formula is for an average of gluon and quark jets. The jet mass 
distribution does not depend strongly on the centre-of-mass energy, but will depend 
on the jet algorithm (as well as the jet size). At the particle level, jet masses will be 
larger than at the parton level, due to non-perturbative contributions but with the 
difference growing smaller with increasing jet pr. 

The jet mass is an interesting variable to measure, not just because of the pertur- 
bative QCD aspects, but because a jet may be massive because it’s a (boosted) W 
boson, or a top quark, or even a Higgs boson [106]. For example, it was shown that the 
discrimination of the signal for a Higgs boson decaying into a bb pair can be improved 
in the kinematic region where the Higgs is at high transverse momentum, and the bb 
final state is reconstructed within a single (fat) jet [297]. 

With a few exceptions [64], there was little investigation into jet masses at the 
TEVATRON, but this information has become an integral part of many analyses at 
the LHC [8], especially in the context of jet grooming. In general, all jet grooming 
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techniques are designed to provide a separation between the decays of heavy objects 
and the QCD branchings that are a normal aspect of parton evolution inside jets. In 
addition, the grooming techniques try to remove soft energy depositions inside the jets, 
which can arise both from the underlying event in a hard collision, as well as from the 
multiple minimum bias interactions that are present in high luminosity LHC running 
conditions. Three of the major grooming tools are filtering [297], trimming [699], and 
pruning [512, 513], which are described below in the context of a CMS analysis. 

CMS has measured jet mass distributions in both V+jets and inclusive dijet events, 
and has examined the impact of filtering, trimming, and pruning on the jet mass distri- 
butions [377]. In filtering, a jet determined through a regular jet clustering algorithm, 
most typically the anti-kr algorithm for the LHC, is re-clustered using the Cambridge- 
Aachen algorithm with a smaller jet size (R = 0.3 for the CMS analysis). The resulting 
new sub-jets are ordered in transverse momentum and the jet is redefined using only 
the three hardest sub-jets. In the trimming algorithm, jets are again re-clustered using 
a smaller jet size (using the kr-clustering algorithm), and sub-jets are only kept if 
they pass the requirement that PTsub > feutAnara: Typically, Anara is chosen to be 
equal to the transverse momentum of the original jet. For this CMS analysis, Rsub has 
been chosen to be 0.2 and feut to be 0.03. The pruning algorithm re-clusters the con- 
stituents of a jet with the Cambridge-Aachen algorithm, using the original parameters, 
but requiring that for two sub-constituents 7 and j, the softer of the two constituents 
is removed when the following conditions are satisfied: 


min(p 1 ;, P1;) 


Ge Oe 9.5 

í Pri TPL; l ve) 

MRS Dm =o, (9.6) 
PL 


where my and pr are the mass and transverse momentum of the (original) jet, and 
the parameters Zcut and œ have been chosen to be 0.1 and 0.5. 

Fig. 9.22 shows distributions of the ratios of jet mass distributions after grooming 
(using either the filtering, trimming, or pruning techniques), for reconstructed data, 
for reconstructed simulated PYTHIA 6 events, and for generator-level PYTHIA 6 events. 
The events are from inclusive dijet production and the jets have been reconstructed 
with the anti-kr (D = 0.7) jet algorithm. All distributions have been corrected back to 
the particle level, to allow for direct comparison to theoretical predictions. In general, 
filtering results in the smallest changes to the original mass distributions, followed by 
trimming and then pruning (with the parameters chosen in the CMS analysis). 

Fig. 9.23 and Fig. 9.24 show the unfolded jet mass distributions for anti-kr (D = 
0.7) jets from Z — ll+jet events for ungroomed jets and pruned jets respectively, 
with the parameters for the grooming as described above.® The data are shown for 
four different jet transverse momentum bins and are compared to predictions from 
PYTHIA 6 and HERWIG ++, with the tunes indicated in the legends. The Sudakov 
suppression at low jet mass, peaking and then slow fall-off with increasing jet mass 
described earlier, can be seen for the ungroomed jet distributions in Fig. 9.23. The 


5Results for filtered and trimmed jets can be found in Ref. [377]. 
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Fig. 9.22 The differential probability distributions for jet mass ratios 
for groomed jets to ungroomed jets for three different grooming tech- 
niques. The data are CMS dijet events,and the Monte Carlo predictions 
use PYTHIA 6. The Monte Carlo predictions are given both at the gener- 
ated and reconstructed levels. Reprinted with permission from Ref. [377]. 


dip for low jet mass fills in with the use of an aggressive grooming procedure such as 
pruning. The peak region of the jet mass distribution receives substantial contributions 
from soft gluon emissions at wide angle. After pruning, these emissions are largely 
removed, resulting in events in the peak region migrating to lower masses [506]. 

In general, the data are in good agreement with the Monte Carlo predictions, except 
perhaps for smaller jet masses. Ref. [377] notes that an aggressive grooming procedure 
(like pruning) tends to lead to better agreement between data and the Monte Carlo 
simulation, and that the data/Monte Carlo agreement is better in general for the 
V-+jets analysis than for the dijet analysis, perhaps indicating that quark jets (more 
typical in V+jet final states) are better modelled in Monte Carlo than gluon jets. 

Formerly, it was thought difficult to provide analytic calculations that describe the 
impact of the jet grooming techniques on distributions, such as jet masses, but great 
progress has been made in recent years [431]. These calculations not only provide a 
better understanding of the different jet grooming techniques, but have also allowed 
for the development of new tools, such as the mass-drop tagger [431] and the soft drop 
tagger [722]. 
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Fig. 9.24 
jet mass distributions for Z — ll + jet events. Reprinted with permission 
from Ref. [377] 
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The same comparison as in Fig. 9.23, for the unfolded pruned- 
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9.3 Drell-Yan type production 


9.3.1 Inclusive spectra 


One of the primary benchmarks, and often one of the first cross-sections to be mea- 
sured, is that for W and Z production. ATLAS and CMS have both measured the W 
and Z cross-section at 7 TeV (during the 2010 running) [12, 361] and in addition 
CMS has measured the cross-sections at 8 TeV (in special low luminosity, and thus 
low pile-up, running conditions) [380]. The cross-sections are measured in both the 
electron and muon channels, with similar requirements for the transverse momenta 
and rapidities for the electron and the muon. The data are corrected for the effects of 
final-state QED radiation and an isolation cut is placed on the lepton candidates. The 
differential cross-sections are then combined after extrapolating each measurement to 
a common fiducial kinematic region. For ATLAS, the missing transverse energy for W 
boson production is required to be greater than 25 GeV and the transverse mass is 
required to be greater than 40 GeV. For CMS, no explicit cut is placed on the missing 
transverse energy, but the missing transverse energy distribution is used to determine 
the background. In CMS, the Z boson candidates are required to have a mass between 
60 and 120 GeV, while for ATLAS the range is 66 to 116 GeV. 

The cross-sections are measured in the fiducial regions as well as being extrap- 
olated to the full phase space. The latter involves a calculation for the geometrical 
and kinematic acceptances for the measurement, and thus the introduction of possible 
model dependence. Typically, POWHEG +PYTHIA is used for the extrapolation. The 
theoretical systematic uncertainties for the acceptance calculation can be evaluated 
by varying the PDFs (using the PDF4LHC prescription for the three global PDF 
families), examining the impact of NNLL soft gluon resummation using the program 
RESBOS, and examining the impact of higher-order corrections by varying the renor- 
malization and factorization scales in the program FEWZ within a factor of two. The 
effects of higher-order EW corrections can also be simulated by the use of the pro- 
gram HORACE. The effect of extrapolating from the fiducial to the full phase space is 
typically to increase the cross section by a factor of about 2(2.5) for W(Z) produc- 
tion. The total uncertainties for the extrapolation corrections for W and Z production 
range between approximately 1 and 1.5%, with each of the sources mentioned above 
contributing. 

The ratio of the W to Z cross-section can be especially interesting, as many of 
the systematic errors, from both experiment and theory, cancel out. The total W 
and Z cross-sections measured by CMS at 8 TeV are shown in Fig. 9.25, along with 
the NNLO predictions from the three global PDF sets. The (solid) ellipse indicates 
the 68% CL region for the total experimental uncertainty. The (open) ellipses for the 
theory predictions indicate the 68% CL PDF uncertainties from each group. The three 
predictions are all consistent with the CMS data, so there is no great discrimination 
among them provided by the total cross section measurements. All of the PDFs provide 
a somewhat lower prediction for the Z boson cross-section than observed in the data, 
though. The ATLAS cross-sections are consistent with those from CMS. The ratios of 
various W and Z cross-sections from CMS at 8 TeV with NNLO predictions are plotted 
in Fig. 9.26. 
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Fig. 9.25 The CMS total W and Z cross-sections at 8 TeV (extrapolated 
from the fiducial cross-sections) compared to NNLO predictions from the 
CT10, MSTW2008, and NNPDF2.3 PDFs. Reprinted with permission from 


Ref. [380]. 
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Fig. 9.26 The ratios of CMs W and Z cross-sections (extrapolated from 
the fiducial cross-sections) to NNLO predictions. Reprinted with permis- 


sion from Ref. [380]. 
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9.3.2 Differential cross-sections in mass and rapidity 


More information can be obtained by measuring double differential Drell-Yan cross- 
sections. CMS, for example, has measured the cross-section for electron and muon pairs 
as a function of the dilepton mass and the cross section for muon pairs as a function of 
mass and rapidity, at centre-of-mass energies of 7 TeV [373] and 8 TeV [667]. The data 
cover a wide kinematic range, from 20 GeV to 2000 GeV. The data are corrected for the 
effects of final-state QED radiation and are extrapolated from the fiducial measurement 
to the full phase space. The consistency of the electron and muon channels allows them 
to be combined. All of the differential cross-sections have been normalized to the cross- 
section for the Z-peak region (60 < m < 120 GeV). 

The combined (electron and muon) Drell-Yan differential cross-section at 7 TeV as a 
function of the dilepton mass is shown in Fig. 9.27, compared to theoretical predictions 
at NNLO QCD using FEWZz 3.1 [761] and the CT10 PDFs, and with electroweak 
corrections at LO and at NLO. The contributions of lepton pair production from yy 
initial states have been taken into account. These contributions increase with dilepton 
mass, reaching up to 10% for the highest mass bin. The higher mass reach at Run II 
will result in even larger contributions from yy initial states, necessitating a better 
determination of the photon PDF in the proton. (See, however, the discussion in 
Section 6.2.2 on the photon PDF and recent advances in its determination.) The data 
are in good agreement with the theoretical predictions using CT10. The measurement 
does not provide, however, sufficient sensitivity to distinguish CT10 from other PDFs. 

The Drell-Yan cross-section at 8 TeV as a function of rapidity is shown for the 
Z boson mass region (60-120 GeV) and the highest Drell-Yan mass bin (200-1500 
GeV) in Fig. 9.28 (from a total of 6 mass bins in the CMS measurement) [667]. NNLO 
predictions from FEWZ 3.1 are shown for two NNLO PDFs. Good agreement with 
the data is observed for both PDFs. Here, in the double-differential distributions, 
the sensitivity is sufficient to distinguish among the various PDF families, and this 
information will be useful in PDF fits utilizing LHC data. 

The ATLAS and CMS measurements of W and Z production are predominantly in 
the central rapidity region, |y| < 2.5. The LHCB experiment, having been designed 
primarily for the study of forward production of particles containing b- and c-quarks, 
extends the kinematic reach for LHC vector boson measurements up to a rapidity of 
4.5. This is very useful, for example, for better constraints on PDFs at both low and 
high xz. LHCB has measured W production at a centre-of-mass energy of 7 TeV in the 
muon channel [408]. The muons in this measurement are required to have a transverse 
momentum greater than 20 GeV and a rapidity between 2 and 4.5. Given the non- 
hermiticity of the detector, there is no requirement on missing transverse energy. To 
reduce backgrounds from heavy flavour decays, the muons are required to be isolated, 
with the sum of the transverse momenta of all charged tracks within a radius of 0.5 of 
the muon direction (as well as a related quantity involving calorimeter information) 
required to be less than 2 GeV. A final state QED radiation correction is applied to 
account for the events lost due to the photon radiation resulting in the muon failing 
the kinematic cuts. 

The resultant cross-sections for Wt and W~ production, and the ratio between 
the two, are shown in Fig. 9.29 and compared to NNLO predictions using six different 
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Fig. 9.27 The differential cross-section for lepton pair production as a 
function of the dilepton mass. The cross-section has been normalized to 
the Z-peak region and the electron and muon channels have been added. 
Comparisons are made to the NNLO predictions from FEWZ using the 
CT10 NNLO PDFs, with EW corrections at LO and NLO added. The 
shaded bands in the lower two plots represent the statistical uncertainty 
from the FEWZ calculation added in quadrature with the 68% CL PDF 
errors from CT10. Reprinted with permission from Ref. [373]. 


PDFs. It is interesting to note that the W~ cross-section is larger than the WT cross- 
section at high muon pseudorapidity, perhaps counter to expectations. The W+ (W7) 
boson at high rapidity is produced primarily from the collision of an up quark (down 
quark) with large momentum fraction z, and a d (ū) anti-quark with low momentum 
fraction x. As the high-z up quark distribution is larger than that of the down quark 
(and the low-a anti-quark distributions are essentially equal), one would expect that 
the WT cross-section would be higher than that of the W- boson. This would be true 
if the cross-sections were plotted against the rapidity of the W boson, cf. Fig. 2.16. 
However, the muon in the decay of the W~ tends to travel in the direction of the boson 
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Fig. 9.28 The Drell-Yan differential cross-sections with respect to rapid- 
ity, in 2 different mass bins. Comparisons are made to predictions obtained 
from FEWZ 3.1 using two NNLO PDFs. Reprinted with permission from 
Ref. [667] 


in the boson centre-of-mass frame (and the muon in the decay of the W* opposite the 
direction). Thus, the u~ leptons tend to have higher transverse momenta than the u” 
leptons, and binning versus the muon pseudorapidity results in a greater proportion 
of u` at high pseudorapidity (cf. the discussion at the end of Section 2.2.3). 


9.3.3 Differential cross-sections in transverse momentum 


ATLAS [34, 52], CMS [366, 666] and LHCB [57] have extensively studied the transverse 
momentum distribution of lepton pairs at centre-of-mass energies of both 7 and 8 TeV. 
The large integrated luminosities allow for differential measurements of the transverse 
momentum to be carried out both as a function of the lepton pair rapidity and of 
its mass, allowing for tests of the resummation and parton showering formalisms in 
different kinematic regions. 

In this section, the focus will be on the ATLAS results at 8 TeV. Cross-sections 
were measured in both the di-muon and di-electron channels. Relatively broad mass 
ranges were chosen to minimize the effect of QED FSR on the signal acceptance. The 
transverse momentum distribution in the Z boson mass range for the di-electron and 
di-muon channels (and the combined result) is shown in Fig. 9.30 (left). The leptons 
were required to have opposite sign, transverse momenta greater than 20 GeV, and to 
be within a rapidity (absolute value) of less than 2.4 (2.47 for the electron channel). In 
addition, an isolation cut was applied in the muon channel to reduce the backgrounds 
from heavy flavour decays. The isolation requirement induces a sizable pr dependence 
in the muon selection efficiency and must be accounted for in each bin. The cross- 
sections in the two channels were then corrected back to the Born level and combined, 
with the relevant correction factors between Born, bare and dressed levels specified for 
the two channels. This is a process which can be measured very accurately. The total 
uncertainty for the normalized cross-section is less than 1% for transverse momenta up 
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Fig. 9.29 The differential cross-sections for W* and W~ production as 
a function of the muon pseudorapidity. Comparisons are made to NNLO 
predictions using six PDFs. Reprinted with permission from Ref. [408]. 


to about 200 GeV. The range at 8 TeV extends from a transverse momentum at 1 GeV 
to approximately 900 GeV. There is an appreciable broadening of the pr distribution 
in going from 1.96 TeV to 8 TeV. This is expected as there is more phase space for 
gluon radiation at the higher energy, due to the lower average x values for the colliding 
partons. 

The data for all three experiments have been compared to a number of theoretical 
predictions, varying from NNLO fixed-order, to NNLO, but also including the effects 
of soft gluon resummation at NNLL, to parton shower Monte Carlos, in some cases 
including fixed-order information at NLO and with various tunes. A comparison of 
the normalized cross-section to the resummed predictions from the RESBOS program 
(NNLO+NNL) is shown in Fig. 9.30 (right). Good agreement with the data is observed 
below 20 GeV. There is a dip around 40 GeV, and then a rise in the theory prediction 
with respect to the data at high transverse momentum. The dip is in the transition 
region where there is a matching between the resummed part of the RESBOS prediction 
and the fixed-order part. Improvements in this matching should reduce this dip. The 
rise at high pr is just an artefact of the scale choice which does not take into account 
the transverse momentum of the Z boson (and thus results in too small of a scale). 

The absolute (un-normalized) high pr (> 20GeV) Z boson cross-section is shown 
in Fig. 9.31 (left), compared to the NNLO predictions of NNLOJET [565]. The NNLO 
corrections are relatively small, but result in a significant reduction of the scale un- 
certainty. The data are above the absolute prediction for most of the Z boson pr 
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Fig. 9.30 The ATLAS Z boson transverse momentum distribution at 
8 TeV is shown for the two Z boson decay channels and their combination 
(left). On the right, the data are compared to the prediction from RESBOS. 
Reprinted with permission from Ref. [52]. 


range. (There is also a 2.8% luminosity uncertainty which is not shown.) If instead, 
the data and theory are normalized to the total Z boson cross-section in this mass 
region, the result is a significant improvement in the agreement, as shown in Fig. 9.31 
(right). There is a tension between the data cross-section and the theory prediction, 
also observed to some extent in Fig. 9.25 and Fig. 9.26. 

The transverse momentum distributions have also been measured, and compared 
to NNLOJET, in various mass and rapidity bins. More details can be found in Ref. [52] 
and Ref. [565]. 


9.3.4 Vector bosons plus jets 


The kinematic reach of the LHC allows for the measurement of W and Z bosons with 7 
or more jets. Perturbative QCD theory has reached the point where NLO predictions 
are available for the production of W + 5 jets and Z + 4 jets [232, 644], and the cross- 
sections for both W and Z+ > 1 jet have been calculated to NNLO [269, 274, 276- 
278, 564]. Jet transverse momenta of 1 TeV or greater have been measured so far, a 
region where both QCD and EW higher-order effects become important. 

ATLAS has measured W and Z boson plus jets production using the anti-kr jet 
algorithm, with a jet size of 0.4 and a jet transverse momentum threshold of 30 GeV [20, 
44]. Comparisons to the data are performed using dressed leptons and with parton- 
level predictions being corrected for the effects of fragmentation and underlying event 
energy. The inclusive jet multiplicity distribution for W+jets production in ATLAS is 
shown in Fig. 9.32 compared to predictions at LO and NLO. BLACKHAT +SHERPA, 
which provides NLO predictions for all jet multiplicities up to 5 (for W+jets), is in 
good agreement with the data, along with MEPS@NLO, which has NLO information 
for up to 2 jets. HEJ, which is based on an all-orders summation of terms describing at 
least two well-separated jets, provides good agreement for jet multiplicities of 2, 3 and 


582 Data at the LHC 


101 NNLOJET pp7Z+20jet vs=8 TeV 10" NNLOJET pp Z+20jet vs=8 TeV 
r r T r r T 
40° ATLAS Data —=— ] 102 E ATLAS Data —— ] 
NNLO —— NNLO —— 
101 No — ] F 493 NLO — ] 
A 8 
— -2 = 4 4 
z 10 $ 10 
N ~ 
$ 10° © 10° 4 
2 © 
b a 
-4 L -6 4 
= 10 NNPDF 3.0 ka 10 NNPDF 3.0 
pPł>20GeV ly41<2.4 pe>20GeV ly% <2.4 
105 F 66 GeV < mı < 116 GeV 1075 66 GeV < mı < 116 GeV 
108 i 108 
1.37 = 
[e] O tap 
a a 
= = 
2 244 
2 2 
T T 
£ € 1.0 
0.9 
1 1 1 1 1 
50 100 500 50 100 500 
pf [GeV] pf [GeV] 


Fig. 9.31 The absolute ATLAS Z boson transverse momentum distri- 
bution at 8 TeV is compared to the predictions of NNLOJET. On the 
right, the normalized data is compared to the normalized prediction from 
NNLOJET. The data are from Ref. [52]. Reprinted with permission from 
Ref. [565]. 


4. ALPGEN +PYTHIA and SHERPA provide good agreement for up to 4 jets in the final 
state, but the predictions from the two programs diverge for higher jet multiplicities. 

The measured transverse momentum distribution for the lead jet for W+ > 1 
jet events is shown in Fig. 9.33, compared to a number of theoretical predictions. 
It is noticeable that the BLACKHAT +SHERPA predictions undershoot the data at 
high transverse momentum. In this region, significant contributions are expected from 
processes such as qq —> qqW, basically dijet production with a W boson emitted from 
a quark line. This process grows with the transverse momentum of the jets primarily 
because W emission becomes competitive with hard gluon emission when the jet pr 
is much larger than the W boson mass. 

The LoopSim [757] (cf. Section 3.4.2) and BLACKHAT +SHERPA exclusive sums [134] 
predictions include more contributions from such final states, but these can be seen 
to have little impact. Note that EW corrections at 1 TeV are negative, which would 
increase the size of the discrepancy. SHERPA and ALPGEN +PYTHIA each provide bet- 
ter agreement with the higher pr range, but with the larger theoretical uncertainties 
(not shown) inherent with LO predictions. The prediction from MEPS@NLO, which 
includes NLO information for W + 1,2 jets, is still below the data at high transverse 
momentum, but closer than BLACKHAT +SHERPA. A similar effect, albeit limited to 
somewhat smaller pr can be seen with Z+jets in ATLAS [20]. However, the situation 
is not as clear for W/Z+jets measurements in CMS [660, 668]. 

The agreement for the inclusive lead jet transverse momentum distribution is better 
when compared to the NNLO theory prediction from Ref. [276], as shown in Fig. 9.34. 
It is interesting as well that better agreement at high pr is observed with the NLO 
prediction from this paper, albeit with a large scale uncertainty. The discrepancy 
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Fig. 9.32 The cross-section for the production of a W boson plus jets as a 
function of the inclusive jet multiplicity. The statistical uncertainties for the 
data are shown by the vertical bars, and the combined statistical and sys- 
tematic uncertainties are shown by the black-hashed regions. The data are 
compared to predictions from BLACKHAT +SHERPA, ALPGEN +PYTHIA, 
SHERPA, and MEPS@NLO. The left-hand plot shows the differential cross— 
sections and the right-hand plot show the ratios of the predictions to the 
data. Reprinted with permission from Ref. [44]. 


between the two NLO predictions may be due to different forms for the central scale 
used in the two theory calculations, an indication that the optimal choice of scale can 
often be difficult. 

Similar comparisons are shown in Fig. 9.35 for the exclusive final state in which only 
one jet is present (above the jet pr threshold of 30 GeV). The transverse momentum 
range is more limited than for the inclusive case as the production of a very high pr 
jet, and no other jets, is very strongly Sudakov-suppressed. In contrast to the inclusive 
case, the BLACKHAT +SHERPA prediction is in very good agreement with the data. 
This is somewhat of a surprise, as the presence of two disparate scales (the pr of the 
lead jet compared to the jet pr threshold of 30 GeV) should lead to the presence of 
large logarithms, which should spoil any agreement with the fixed-order prediction. 
Here again, though, note that the EW corrections are even larger (and negative) for the 
exclusive case than for the inclusive case. A similar level of agreement was observed for 
the ATLAS Z +1 jet exclusive jet pr distribution. This mystery was solved in Ref. [273], 
where it was shown that the ATLAS analysis removed only the jet (and not the event) 
in the situation where there is an overlapping jet and a lepton (within AR < 0.4). 
Thus, an event classified as a W/Z plus exactly one jet event may also have a second 
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Fig. 9.33 The cross-section for the lead jet transverse momentum for 
W+ > 1 jet events, with theory comparisons as in Fig. 9.32. The theoretical 
predictions have been scaled to the data to allow for easier comparisons of 
the shapes. Reprinted with permission from Ref. [44]. 


jet, in close proximity to a lepton, a configuration which has a collinear enhancement. 
This feature of the analysis (and of similar analyses in ATLAS) has since been removed. 

The cross-section for the fifth leading jet transverse momentum is shown in Fig. 9.36. 
Of course, the dynamic range is smaller than for the leading jet pr, but it is evident 
that the NLO predictions describe the data well even for a 2 — 6 process. Finally, 
the Hr distribution for events with W+ > 1 jet is shown in Fig. 9.37. As a reminder, 
Hr is the scalar sum of the transverse momenta of all jets and leptons (including the 
missing transverse momentum from the neutrino) in the event. Here the NLO predic- 
tions from BLACKHAT +SHERPA are a factor of two below the data for Hr values of 
the order of 2 TeV. For high Hr, as for high p3*"', the dominant sub-process becomes 
qq — qqW, where a W boson is emitted off Gia one of the quark lines. This subprocess 
is present as a real correction to W+ > 1 jets at NLO. Formalisms that include the 
virtual corrections for W+ > 2 jets, such as BLACKHAT +SHERPA exclusive sums, or 
LoopSim, reduce, but do not eliminate the discrepancy. MEPSQ@NLO, which has the 
one and two jet matrix elements at NLO included, provides a good description over 
the full dynamic range. Better agreement with the data is also seen using the NNLO 
calculation from Ref. [276], as observed in Fig. 9.38. The NNLO W+ > 1 jet prediction 
naturally includes the W+ > 2 jet cross-section at NLO. 

The exclusive jet multiplicity distribution is shown for Z+jets in ATLAS in Fig. 9.39 
(left) [20]. Similar to the W+jets analysis, a transverse momentum cut of 30 GeV 
and an absolute rapidity cut of 4.4 have been applied. The exclusive jet multiplicity 
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Fig. 9.34 The cross-section for the lead jet transverse momentum for 
W+ > 1 jet events, with theory comparisons from Ref. [276]. Reprinted 
with permission from Ref. [276]. 


distribution for Z+jets (as for W+jets) follows a staircase pattern indicative of Berends 
scaling, as discussed in Sections 4.1 and 4.3. This scaling is a result of the same pr 
cut being applied to every jet, and to the non-Abelian nature of the gluon branching 
process, i.e. each final state gluon itself carries a colour charge and can itself radiate. 
If instead, there is a difference in the transverse momentum threshold between the 
leading jet in the event, and any additional jets, a Poisson-type behaviour will instead 
be evident. In Fig. 9.39 (right) ,the lead jet pr is required to be greater than 150 GeV, 
while still retaining a cut of 30 GeV on the other jets. The result is described well by 
a Poisson scaling, 
o(Z+(n+1) jets) ñ 
o(Z+njets) n 


(9.7) 


>) 


with an expectation value n = 1.04+0.04. Note that there is basically no suppression 
for the second jet emission, given the core process of Z+jet, with the jet having a 
transverse momentum greater than 150 GeV. For both situations, the data are well- 
described by the theoretical predictions. 


9.3.5 Vector bosons plus heavy flavours 


The case where a vector boson is produced in association with one or more jets that 
originates from a heavy quark, either a b- or a c-quark, is especially interesting. Such 
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Fig. 9.35 As for Fig. 9.33, but for the lead jet transverse momentum in 


W+ exactly one jet events. Reprinted with permission from Ref. [44]. 
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Fig. 9.36 As for Fig. 9.33, but for the fifth jet pr in W+ > 5 jet events. 
Reprinted with permission from Ref. [44]. 
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Fig. 9.37 As for Fig. 9.33, but for the Hr distribution in W+ > 1 jet 
events. Reprinted with permission from Ref. [44]. 


processes are important as backgrounds for other physics studies, such as associated 
Higgs boson production (V H), where the Higgs boson decays into a bb final state, for 
single top production [14] or for searches for physics beyond the Standard Model [23]. 
However, they are also interesting from the standpoint of perturbative QCD, since 
the presence of a heavy quark mass scale introduces additional complications into the 
calculation. As in the case of single top production, cf. Section 4.6, the calculation may 
either be performed in a 4-flavour scheme (in which the only active quark flavours are 
u, d,c,s) or the 5-flavour scheme, in which the b parton is also present in the initial 
state. As an example, consider the Born-level predictions for the production of a final 
state containing a W-boson and at least one b-jet. In the 4-flavour scheme this can 
be described by the processes qg > Wbb and gq > Wbbq, where the presence of 
the initial-state gluon in the latter is important at the LHC. In the 5-flavour scheme 
this second process is replaced by bg > Whq. The advantages and drawbacks of each 
scheme have been summarized in Section 4.6. 

Such processes have been measured at the TEVATRON by both the CDF [62] and 
DØ [92] collaborations. The CDF collaboration found a result for the W + b jet cross- 
section larger than the SM prediction, while DØ measured a cross-section smaller 
than the SM prediction, although both were consistent within the quoted theoretical 
uncertainties. Such measurements have also been carried out at the LHC [11, 381].° 
ATLAS, for example, has measured the fiducial W + b jet cross-section, as a function 
of the jet multiplicity and as a function of the b — jet transverse momentum [19]. 


Similar results are also available for Z + b jet cross-sections [26, 367, 372, 382]. 
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Fig. 9.38 The cross-section for the Hr (also known as Sr) distribution 
for W+ > 1 jet events, with the theory comparisons from Ref. [276]. 
Reprinted with permission from Ref. [276]. 


The results are reported either with the single-top contribution subtracted, or not- 
subtracted. They have the same (inclusive) final state, but single top production has 
different kinematics, so can be separated. 

As for other measurements involving W boson decay, a high pr isolated lepton 
(either a muon or electron) is required, along with a requirement on a minimum 
missing transverse energy. The analysis requires one or more jets, with one (and only 
one) jet being tagged as a b-quark jet. It is necessary to veto on events with two 
or more b-tagged jets to reduce the background from the (sizeable) tt cross-section. 
Jets are reconstructed with the anti-kr (D = 0.4) jet clustering algorithm and a jet 
threshold of 25 GeV. The jets are required to have a rapidity |y| < 2.1, so that the 
jet lies within the tracking (and thus b-tagging) region, and any jets within a distance 
AR = 0.5 of the lepton candidate are removed. Jets are tagged as originating from 
b-quarks using a combination of two tagging algorithms. The first algorithm involves 
either explicit reconstruction of a secondary vertex consistent with originating from a 
b-quark decay. The second calculates the impact parameter significance of each track 
within the jet to determine the probability of the jet being a b-quark jet. 

The measurement of this process has backgrounds, both from processes where a 
real b is present in the final state (single-top, tt, and multi-jet), and from processes 
where a jet has been mis-tagged as a b-jet (such as W + c — jet and W + light 
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Fig. 9.39 Ratios of exclusive jet multiplicity distributions for Z+jets in 
ATLAS with (left) the standard selection cuts, and (right) requiring at 
least one jet with transverse momentum above 150 GeV. Reprinted with 
permission from Ref. [20]. 


jets). The backgrounds can be largely determined from the data itself, either by the 
presence of different kinematics (for the first category of backgrounds), or from different 
characteristics of the b-tagged jets (for the second category). For example, in the 2-jet 
bin, the contributions from W + b and single-top processes are comparable. However, 
since most of the single top events have a relatively narrow (W, b—jet) mass distribution 
around the top quark mass, this distribution can be used to discriminate the two 
processes. 

The measured (unfolded) cross-section for the production of a W boson and a b-jet 
is shown in Fig. 9.40 as a function of the jet multiplicity. Results are shown for the 
electron, muon and combined (electron+muon) channels, and comparisons are made 
to theoretical predictions from fixed order (MCFM) and parton shower Monte Carlo 
predictions (POWHEG +PYTHIA and ALPGEN +HERWIG). The Monte Carlo predictions 
use the 4-flavour scheme, while the MCFM prediction includes higher-order corrections 
from the 5-flavour scheme, i.e. allowing b-partons in the initial state. The fixed-order 
predictions have been corrected for non-perturbative effects and for double-parton 
scattering, where a W boson, and a bb final state, are produced in two separate proton- 
proton collisions. The latter cannot be ignored for this measurement at the LHC, and 
amounts to a 25% correction to the total cross-section, concentrated in the lowest pr 
bins. The kinematics of this process are complex and are reflected in the choice of the 
central scale for the theoretical calculations, 
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Fig. 9.40 The measured W + b-jet cross-section is compared to several 
theoretical predictions, in the one-jet, two-jet and one-jet+two-jet bins. 
Reprinted with permission from Ref. [19]. 
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Somewhat conservatively, the scale is varied by a factor of four about the central scale, 
rather than the traditional factor of two. 

The results for the 1-jet bin are slightly higher than the theoretical predictions (but 
within the combined experimental and theoretical uncertainty bands) while the results 
for the 2-jet bin are in good agreement with the theory. The differential cross-sections 
are shown in Fig. 9.41, for the 1-jet (left) and 2-jet (right) bins. In both cases, the 
agreement of data with theory worsens as the b-jet transverse momentum increases. 

CMS has measured the Wb final state in which both jets are required to be tagged 
as b-jets [381]. There the results, over a similar phase space as the ATLAS measurement 
(except for the requirement of 2 b-tags rather than a restriction to one and only 1), are 
in good agreement with the SM prediction. The AR distribution between two b-jets, 
for the process Z+ > 2 b jets, has been measured in Ref. [26], and is shown in Fig. 9.42, 
compared to a variety of predictions. Good agreement is observed with the fixed-order 
prediction from MCFM, with SHERPA and with aMC@NLO, in the 4-flavour scheme. It 
is noteworthy, however, that no prediction describes the first bin well, when the two 
b-jets approach collinearity,indicating perhaps a difficulty in describing the collinear 
splitting of a gluon into a bb final state. This was also a difficulty at the TEVATRON, 
and could have implications for other final states involving bb pairs at the LHC, such 
as for associated Higgs boson production, especially in the boosted region. 
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Fig. 9.41 The measured W + b-jet differential jet cross-section, compared 
to theoretical predictions from ALPGEN and MCFM, in the one-jet (left) 
and two-jet (right) bins. The MCFM cross-sections have been corrected for 
non-perturbative effects. The ALPGEN prediction has been interfaced to 
HERWIG and JIMMY and has been scaled to the inclusive NNLO W cross- 
section. The ratios between the measured and predicted cross-sections are 


also shown. Reprinted with permission from Ref. [19]. 


9.3.6 Single photons 


At the LHC, in contrast to the TEVATRON, the single inclusive photon production 
process is dominated by the gq initial state for moderate to high Er photons. Thus, a 
precision measurement of the process provides information on the gluon distribution 
complementary to that provided by the inclusive jet cross-section. The energy and the 
direction of the photon can be measured very well (better than that for a comparable 
jet); thus, in principle the photon cross-section can be measured with greater precision 
than the jet cross-section. Photon production does suffer, however, from backgrounds 
resulting from jets that fragment into one or more mesons (such as neutral pions) 
carrying a large fraction of the parent jet’s momentum, as discussed in Section 8.4. 
As at the TEVATRON, this background can be reduced by imposing an isolation cut, 
and quality identification cuts, on the photon candidates. The impact of the isolation 
cut is not only to reduce the background due to jets, but also to reduce the photon 
production cross-section due to fragmentation processes. 

In ATLAS [30, 51], an isolated photon is defined as one in which there is a restricted 
amount of additional energy around the photon candidate in a cone of radius R = 0.4.10 
The energy in the isolation cone has already been corrected for both the underlying 
event and the effects of multiple interactions. In Fig. 9.43 is shown (left) the energy 
distribution in the isolation cone for tight and non — tight photon candidates in the 


10For example, for the 8 TeV analysis, the requirement on the isolation energy is: Ege < 4.8 GeV+ 
-3 pT 
4.2 x 107° Ep. 
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Fig. 9.42 The measured AR distribution between 2 b jets, in Z+ > 2 b 
jet events, compared to a variety of predictions. Reprinted with permission 


from Ref. [26]. 


7 TeV data from Run I. The non-tight photons fail the tight identification criteria in one 
or more categories, and thus are likely to be produced as a result of jet fragmentation. 
In general, the non-tight photon candidates are less isolated than the tight ones. By 
determining the fraction of loose photons present in the tight isolation region, the 
photon backgrounds can be determined for each kinematic bin. The resulting photon 
purity is shown in Figure 9.43 (right) for two rapidity intervals in the 7 TeV analysis. 
The imposition of the isolation cut results in a high photon purity, which approaches 
100% for high photon transverse energy. As stated in Section 8.4, an isolated high 
transverse energy photon candidate is much more likely to be a true photon, than to 


15 2 25 3 35 4 45 5 
AR(b,b) 


be a product of jet fragmentation. 


The resulting inclusive photon cross-sections for a centre-of-mass energy of 8 TeV 
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Fig. 9.43 Left: the transverse energy in the isolation cone for photon can- 
didates (both tight and non-tight) for photon transverse energies above 
100 GeV. The non-tight distribution has been normalized to the tight dis- 
tribution in the (background-rich) region above 15 GeV. Right: the result- 
ing photon purity is shown as a function of the photon transverse energy 
in the ATLAS barrel and end-cap regions. The shaded bands indicate the 
statistical uncertainties. Reprinted with permission from Ref. [30]. 


are shown in Fig. 9.44 for four rapidity intervals. A large dynamic range is represented 
in this plot, from 30 GeV to over 1 TeV. There was a considerable reduction in the 
size of the experimental systematic errors from the 7 TeV analysis to the 8 TeV 
analysis. The data is systematically below the NLO predictions from JETPHOX [348] 
for transverse momenta below 500 GeV. A somewhat better agreement with the data 
is achieved with the PeTeR prediction [208]. Here, the calculation is again carried out 
at NLO, but in addition threshold logarithms are resummed at next-to-next-to-leading 
accuracy. Note that since the calculations are at NLO, the scale uncertainty is still 
sizable. The uncertainty will be reduced once a NNLO calculation for inclusive photon 
production is completed, which will add to the attractiveness of the process as an 
input into global PDF fits.'! 

Similar methods have been used by CMs [359, 384] for measuring the isolated inclu- 
sive photon cross-section, with similar results obtained. More potential information on 
the PDF's of the colliding protons can be obtained by measuring the distribution of the 
accompanying jet, in addition to the photon, at the cost of a somewhat less-inclusive 
theoretical prediction [13, 384]. As at the TEVATRON, the inclusive photon+jet data 
are very useful for the calibration of the jet energy scale. 


11Late in the editing of this book, the NNLO calculation has in fact been completed, in Ref. [316]. 
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Fig. 9.44 The ATLAS isolated photon cross-section, for four rapidity re- 


gion. The inner error bars on the data points indicate statistical errors 
only, while the outer error bars show the statistical and systematic errors 
added in quadrature. The error band given is that from the PeTeR calcula- 
tion,and corresponds to the combination of the scale, PDF and electroweak 
undertainties. Reprinted with permission from Ref. [51]. 


9.4 Vector boson pairs 
9.4.1 Diphotons 


One of the key Higgs boson final states at the LHC is the decay into two photons. The 
Higgs boson diphoton signal is typically swamped by the much larger QCD diphoton 
production rate, but with fine enough diphoton mass resolution, the presence of a 
narrow Higgs boson resonance can be detected (and indeed has been). Nevertheless, 
it is important to understand QCD production of diphotons, especially as approxi- 
mately half of the production proceeds through gg scattering, the same initial state 
that dominates Higgs boson production. A cut is placed on the transverse energy of 
the leading (second leading) photon of 25 GeV (22 GeV). The slight asymmetry helps 
to reduce any instability in the higher-order calculations. As for single photon pro- 
duction, backgrounds due to jet fragmentation are suppressed by the imposition of an 
isolation cut, similar to the one used for the inclusive photon measurement. This iso- 
lation cut also suppresses production mechanisms where one or both photons results 
from photon fragmentation from a quark line. Results from an ATLAS measurement of 
diphoton production at 7 TeV [18] are shown in Fig. 9.45 for the diphoton transverse 
momentum. As at the TEVATRON, there is a shoulder (the Guillet shoulder) at a trans- 
verse momentum of approximately 50 GeV, corresponding to events that also have a 
low azimuthal separation. The agreement with the DIPHOX +GAMMA2MC prediction is 
poor for these two variables, but much better for the 2yNNLO and SHERPA predictions. 
The key aspect for both of the latter calculations is the presence of tree-level 2 > 4 
processes, needed to describe the two observables. 
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Fig. 9.45 The ATLAS diphoton transverse momentum data compared to 
predictions from DIPHOX +GAMMA2MC, 27NNLO and SHERPA. The error 
bars correspond to the total experimental uncertainties, which are dom- 
inated by the systematic errors. Reprinted with permission from from 
Ref. [18]. 


The diphoton mass distribution is shown in Fig. 9.46. The agreement with the 
2YNNLO prediction is good for the entire mass range; there is a disagreement with 
the DIPHOX prediction at low diphoton masses, where the 2 — 4 subprocesses are 
important, and to a lesser extent for masses of a few hundred GeV, where NNLO 
corrections may be important. 

Similar results have been obtained by CMS [378]. 
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Fig. 9.46 The ATLAS diphoton data compared to predictions from 
DIPHOX +GAMMA2MC, 2yNNLO and SHERPA. The error bars correspond 
to the total experimental uncertainties, which are dominated by the sys- 
tematic errors. Reprinted with permission from Ref. [18]. 


9.4.2 Dibosons 


The measurement of diboson final states provides an important test of the non-Abelian 
nature of the Standard Model, and a sensitivity to anomalous triple gauge boson 
couplings. In addition, the WW and ZZ final states are important for the measurement 
of the Higgs boson decays into those two channels. There is a large background for 
the measurement of WW production from tt production; thus, commonly a jet veto 
requirement is applied to reduce the latter. Cross-sections then are commonly corrected 
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for the geometric and kinematic acceptances as well as for the impact of the jet veto 
to obtain a fully inclusive cross-section [21, 53, 375, 673], more easily compared to 
theoretical predictions. The imposition of a jet veto restricts the phase space for gluon 
emission and thus results in an increased uncertainty in the predicted cross-section. 
However, in the case of diboson production, the scale dependence at NLO is inherently 
small, and thus the increased uncertainty for the vetoed cross-section is still less than 
the experimental systematic uncertainty, and may not represent the full theoretical 
uncertainty. 

WW production is typically measured in the final state where both W bosons 
decay into a lepton (electron or muon) and a neutrino. Thus, the signature consists 
of two high pr leptons and a substantial amount of missing transverse energy. The 
dominant sub-process is gg > WW, with much smaller contributions (approximately 
10% total) from gg > WW and gg => H > WW. The main backgrounds come from 
Drell-Yan, top (tt and single top) production, W+jets, and diboson (WZ, ZZ, Wy) 
production. The top background is suppressed by the requirement that there can be 
no jets with transverse momentum above 25 GeV within a rapidity interval of +4.5.!? 

The fiducial cross-section is corrected for identification and isolation requirements 
and detector resolution effects. The total WW cross-section is calculated by correcting 
the fiducial cross-section for the extrapolation to the full WW phase space. A small 
excess with respect to the NLO predictions is observed by ATLAS, with similar results 
obtained in CMS. There has been speculation that this excess might be the result of 
new physics [423, 424, 522, 647, 828]. However, there are a few caveats. The WW 
cross-section has recently been calculated to NNLO, resulting in an increase over the 
NLO result of the order of 10% [557], thus significantly decreasing the excess. (A 
further 2% increase in the theoretical prediction results from a 2-loop calculation of 
the subprocess gg — WW [324].) In addition, Ref. [768] points out that the ATLAS 
fiducial cross-section is in agreement with the theoretical prediction for the same, and 
the disagreement for the total cross section results from the extrapolation to the full 
phase space using the POWHEG box. The Monte Carlo result for the jet-veto efficiency 
overestimates Sudakov suppression effects with respect to a calculation using analytic 
resummation. This is one of the perils of comparisons at the fully inclusive (corrected) 
level, compared to fiducial comparisons. 

A comparison of the NLO and NNLO predictions, and of ATLAS and CMS data, 
for WW production from Ref. [557] is shown in Fig. 9.47. 

Other diboson final states (for example ZZ, WZ, Wy, Zy) have also been mea- 
sured by both ATLAS and CMS [22, 41, 375, 665]. A summary of CMS results is shown 
in Fig. 9.48, where good agreement with NLO and NNLO Standard Model predictions 
is observed. Differential distributions can be used to place limits on anomalous cou- 
plings. For example, in Fig. 9.49 are shown (left) the unfolded transverse momentum 
distribution for the leading Z boson and (right) the four-lepton reconstructed mass 
distribution, both from Cms [664]. The presence of anomalous triple gauge couplings 
would manifest itself as deviations from the Standard Model predictions at high Z 
pr/ high four-lepton mass. In both cases, good agreement with the Standard Model 
predictions is observed. 


12These specific cuts are for ATLAS but are similar for CMS. 
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Fig. 9.47 The ATLAS and CMS WW cross-sections compared to the pre- 
dictions at NLO and NNLO from Ref. [557]. Reprinted with permission 
from Ref. [557]. 


9.4.3 Vector boson backgrounds for BSM searches 


The Standard Model has been extremely successful at LEP, HERA, the TEVATRON 
and now at the LHC. However, as discussed earlier, the SM is incomplete, and one 
of the main goals of the LHC is the discovery of new physics beyond the current 
paradigm. Such Beyond-the-Standard Model (BSM) physics is mostly expected at high 
mass scales, where the energies of previous colliders would not have been sufficient to 
discover it already. The signatures of BSM physics are mostly comprised of the same 
observables considered in this chapter: photons, leptons, jets (with or without b-tags) 
and missing transverse energy, but with cuts appropriate to the expected higher mass 
scale. Often the dominant contribution to these final states comes from the production 
of vector bosons, either singly or in pairs, plus jets. In that case the measured SM 
cross-sections can serve to determine the backgrounds to new physics processes, for 
instance by extrapolating to new kinematic regions. The possibility that new physics 
(such as stop pair production) could be hiding in the WW cross-section measurement, 
for final states involving two leptons and large missing transverse energy, was already 
mentioned in Section 9.4.2. 

As an example, consider a BSM search performed by CMS using the full 2012 data 
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Fig. 9.48 Comparison of CMS measurements of diboson cross-sec- 
tions at 7, 8 and 13 TeV with NLO and NNLO predictions, 
from twiki.cern.ch/twiki/pub/CMSPublic/PhysicsResultsCombined/. 
Reprinted with permission from CERN. 
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Fig. 9.49 (left) A comparison of the (normalized) leading Z boson trans- 
verse momentum distribution from CMS to a NLO prediction from MCFM. 
(right) A comparison of the (normalized) four-lepton mass distribution 
from CMS to a NLO prediction from MCFM. Reprinted with permission 
from Ref. [664]. 
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sample at 8 TeV, targeting new physics in multi-jet final states [386]. To focus on the 
region most sensitive to BSM signals, the search requires both large amounts of total 
transverse energy of the jets (Hr) and missing transverse energy Mr. Specifically, 
the search region is defined by the requirements that there be three or more jets, 
with pr > 50 GeV and |n| < 2.5, a total transverse energy sum of the jets greater 
than 500 GeV, and a missing transverse energy greater than 200 GeV. This final 
state is sensitive to the production of pairs of squarks and gluinos, where the squarks 
(gluinos) each decay into one (two) jets and a lightest supersymmetric partner (LSP). 
There are substantial contributions to this final state from Z(— vi) + jets and from 
W(—> év)+jets (and tt production), when an electron or muon is lost or when a 7 lepton 
decays hadronically. The cross-section for Z(— viv) + jets is estimated using the larger 
measured cross-section for y+jets, correcting for the electroweak coupling differences, 
and making use of the similar kinematic properties exhibited by the two processes. To 
reduce the backgrounds from W-+jets and tt processes, events with isolated electrons 
or muons with transverse momentum greater than 10 GeV are vetoed. The surviving 
events are those in which any leptons escape detection, and thus a good knowledge of 
the lepton reconstruction efficiency is necessary to accurately predict this background. 

In addition the analysis is subdivided into bins that correspond to the number 
of jets in the final states, 3-5 jets, 6-7 jets and 8 or more jets, in order to retain 
sensitivity to possible longer cascades of squark and gluino decays. The CMS data is 
shown as a function of Ar for one of the jet bins in Figure 9.50, and compared to the 
predicted backgrounds and possible signals for several squark and gluino production 
and decay modes. The number of events observed in the data is consistent with the 
number expected from SM background processes. For low jet multiplicities the primary 
background for high values of Hy is from Z(— vi) + jets events, whereas in the 
highest jet multiplicity bin the largest background comes from W-+jets and tt events. 
Smaller backgrounds result from QCD multi-jet production, where the large Hr is 
produced primarily from heavy flavour decays inside the jets or from jet energy mis- 
measurement. This background is more significant (but still sub-leading) for the higher 
jet multiplicity bins. The larger the number of jets, the greater is the chance that one 
or more jets will contribute significant Hr due to the causes mentioned above. 


9.5 Tops 
9.5.1 Distributions: top-pairs (plus jets) 


Measurement of the top pair cross-section at the LHC allows for precision tests of QCD, 
particularly with the theoretical prediction now known to NNLO. The top pair cross- 
section is significantly larger at the LHC than at the TEVATRON, partially because of the 
dominance of the gg initial state for top production at the LHC, and the rapid increase 
of the gg PDF luminosity with energy, as discussed in Chapter 6. A comparison of 
the cross-section measurements at the TEVATRON and at the LHC (7 and 8 TeV) is 
shown in Fig. 9.51. Top pair production can be measured in a number of final states, 
depending on the decay modes of the W bosons that are produced. The most useful of 
these states have at least one leptonic W-decay, with any leptons produced at high-pr 
and well-isolated. In the dilepton mode the two leptons are of opposite charge and 
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Fig. 9.50 The observed Hr distributions from CMS for events with 
Hr > 500 GeV and jet multiplicities from 35. The data are compared 
to the SM backgrounds and the predictions for several different SUSY sce- 
narios. Reprinted with permission from Ref. [386]. 


are accompanied by two jets. In the case where one W-boson decays hadronically, the 
final state corresponds to a single lepton and three or four jets. The jets should have 
transverse momenta above 25 — 30 GeV and at least one of them should be tagged 
as a b-jet. The jet threshold is higher at the LHC than the TEVATRON because of the 
larger underlying event, as well as the greater pileup through much of the running at 
7 and 8 TeV. The better tracking detectors in ATLAS and CMS than at the TEVATRON 
have resulted in higher b-tagging efficiencies, typically of the order of 80%, compared 
to the 50-60% efficiencies for CDF and D@. ATLAS and CMS both use the anti-kr jet 
algorithm for jet reconstruction, with jet sizes of 0.4 and 0.5 respectively used for the 
two experiments (at 7 and 8 TeV). The cross-sections determined from the different 
final states agree with each other, as do the results from the two experiments. The 
experimental results also agree well with the NNLO+NNLL predictions of Ref. [427]. 
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Fig. 9.51 A compilation of top pair production cross-sections 
as a function of centre-of-mass energy from the LHCTopWG 
(twiki.cern.ch/twiki/bin/view/LHCPhysics/LHCTopWG) compared to 
the NNLO+NNLL predictions of Ref. [427]. Reprinted with permission 
from CERN. 


It is notable that, despite the impressive precision of the theoretical prediction, the 
experimental errors are smaller still. 

The top mass can be measured as well from the same final states used in the cross- 
section measurements. A compilation of the top mass measurements at ATLAS and 
CMS compared to the TEVATRON average, the LHC average, and the world average, 
is shown in Fig. 9.52. The precision at the LHC is still not at the same level as the 
final measurements from the TEVATRON, but it should surpass the latter in Run 2. 
Theoretical issues (such as recombination effects and uncertainties as to what exactly 
the measured top mass represents), discussed in Section 4.5, will now have to be 
addressed. 

Differential measurements of tt final states allow for additional precision tests of 
perturbative QCD, probes of high mass regions sensitive to new physics, and more de- 
tailed information on tt kinematics useful for PDF determination. Differential predic- 
tions for tt production at NNLO have been recently calculated [426, 429]. Differential 
measurements have been performed for such variables as the tt mass, the tt rapidity 
distribution, the transverse momentum of the top quark, and the tt transverse momen- 
tum distribution (as well as others). In Ref. [56], the experimental results have been 
unfolded both to a fiducial particle-level phase space and to a fully-corrected phase 
space. The former has less of a model dependence, and thus smaller uncertainties, and 
the latter is often more appropriate for comparison to higher-level predictions. 
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Fig. 9.52 A compilation of top mass determinations from 
ATLAS and CMS compared to the LHC and world averages, from 
atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/CombinedSummaryPlots/TOP/. 
Reprinted with permission from CERN. 


The tt mass and rapidity distributions, corrected to the full phase space, are shown 
in Fig. 9.53 for the ATLAS 8 TeV measurements. (Results at 7 TeV for ATLAS can be 
found at [40] and 7 and 8 TeV results for CMS can be found at [371] and [662]. It is 
noteworthy that ATLAS and CMS have adopted the same binning for the fully-corrected 
distributions, allowing for easier future combinations.) The data are compared both 
to a POWHEG +PYTHIA 6 prediction and to the NNLO differential (fixed-order) pre- 
diction, both using the MSTW2008 NNLO PDFs. Good agreement with the data 
is observed for both predictions for the mz distribution, while the NNLO prediction 
agrees better with the y;z data. In general, the agreement seems to be better for NNLO 
comparisons than for NLO comparisons, and better with the more recent PDFs than 
with previous generations. There is still a sizable PDF sensitivity, especially at high 
mz and y, paving the way for the inclusion of this data into global PDF fits. One 
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Fig. 9.53 The ATLAS tt normalized, differential cross-sections for the tt 
mass (left) and tt rapidity (right), compared to theoretical predictions 
at NNLO using the MSTW2008 PDFs. Reprinted with permission from 
Ref. [56]. 


caveat is that electroweak effects, which can be sizable, are also not yet available for 
all observables [705]. 

Measurements of the tt asymmetry at the LHC are much more difficult than at 
the TEVATRON due both to the symmetric nature of the colliding beams, and the 
dominance of the (symmetric) gg subprocess for tt production. Also, as a result of this 
symmetry, the forward-backward asymmetry measured at the TEVATRON (Ay = y+ — 
yz) is no longer a useful variable, and the charge asymmetry instead (Ay = |y:| — lyzl) 
must be measured. Perturbative QCD predicts top anti-quarks to be produced more 
centrally than top quarks. The expected value for this asymmetry is smaller than the 
forward-backward asymmetry measured at the TEVATRON. A number of measurements 
have been carried out by both ATLAS and CMS of the inclusive charge asymmetry [10, 
32, 50, 363, 365, 670, 672, 674], and the results are in agreement with the NLUO+EW 
prediction, as observed in Fig. 9.54. Measurements have also been made for high tt 
mass (mz > 600 GeV) and for boosted tt systems (Bz > 0.6) [32], where the effects 
of any new physics may be expected to be magnified [128]. No deviation from the 
SM predictions is observed for these special kinematic regions. Note that unlike the 
TEVATRON, NNLO predictions for this observable are not yet available for the LHC (at 
the time of this book). 


9.5.2 Single top production 


Single top production, measured at the TEVATRON in the s and t channels (see Sec- 
tion 8.6), is dominated by t—channel production at the LHC, as shown in Figure 9.55. 
There is a large growth in the single top cross-section from the TEVATRON to the LHC 
(8 TeV), of about a factor of 60. ATLAS and CMS have measured single top cross sec- 
tions for the t—channel [24, 657] and for the Wt final state [7, 385] while setting upper 
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Fig. 9.54 A compilation of top pair charge asymmetry mea- 
surements at the LHC compared to SM predictions, from 
atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/CombinedSummaryPlots/TOP/. 
Reprinted with permission from CERN. 


limits for the s— channel contribution, which has of course already been measured at 
the TEVATRON [47]. Since the tW final state can be distinguished from the other single 
top final states, its analysis is usually conducted separately from the other two. 

As at the TEVATRON, the signature for t channel production involves the presence of 
an isolated high pr lepton, substantial missing transverse energy, and two or three jets, 
with at least one jet tagged as a b-jet and with one jet at high rapidity. The rapidity 
distribution for the (untagged) forward jet serves as a good discriminant for separating 
t-channel single top production from the s-channel mode and other SM backgrounds. 
Measurements of the t-channel cross-section at the LHC, and the theoretical prediction 
of NLO QCD, are shown in Fig. 9.56. The cross-sections measured by ATLAS and CMS 
are in good agreement both with each other and with the theoretical prediction at 
this order. Although the NNLO prediction for this cross-section [288] has not been 
compared in this figure, it also agrees very well. For instance, the comparison with the 
combined results of ATLAS and CMS for the (t + t) single top t-channel process is, 


o“TLAs+coms = 85 +12 pb, ONNLO = 83.9755 pb. (9.9) 
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Fig. 9.55 A compilation of single top production cross-sections 
from ATLAS and CMS as a function of the centre-of-mass energy, from 
atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/CombinedSummaryPlots/TOP/. 
Reprinted with permission from CERN. 


The difference between the two uncertainties underscores both the difficulty of mea- 
suring this process at the LHC and the precision of the NNLO calculation. 

Due to the LHC being a pp collider, the production of single top quarks is larger than 
that of single anti-top quarks. The NNLO prediction for the ratio of t/t production is 
1.825 + 0.001. This ratio is sensitive to the distribution of up and down type quarks 
in the proton, as well as possible new physics that may couple to the Wtb vertex. 
The ratios measured by ATLAS and CMS are consistent with the Standard Model 
predictions, albeit with relatively large statistical and systematic uncertainties, as 
observed in Fig. 9.57. If one assume that no anomalous form factors are present at the 
Wtb vertex it is possible to extract the size of the (t,b) CKM matrix element, |Vj,]. 
ATLAS and CMS measure 1.02 + 0.07 and 0.998 + 0.041 respectively, where the total 
error includes both the experimental and theoretical errors added in quadrature. 


9.6 Higgs boson 
9.6.1 Introduction 
The discovery of the Higgs boson at the LHC by the ATLAS [15] and CMS [370] ex- 


periments can be considered as the culmination of the Standard Model. Because of 
its importance, a great deal of theoretical effort has been devoted towards precision 


Higgs boson 607 


ATLAS+CMS Preliminary TOPLHCWG 
Data 2012, Ys = 8 TeV 
rere NLO (MCFM), m,, = 172.5 GeV, 
PDF4LHC (MSTW2008, CT10, NNPDF2.3) 
E scale uncertainty 
scale ® PDF © a, uncertainty 


ATLAS, L. = 5.0 fb” 


int 


ATLAS-CONF-2012-132 


CMS, L, = 5.8 fb" 
CMS-PAS-TOP-12-011 


LHC combined (Sep 2013) 


ATLAS-CONF-2013-098, 
CMS-PAS-TOP-12-002 


— stat. uncertainty 
— total uncertainty 
stat) +(syst) +(lumi) 


© t-channel +( 


95.1+2.4+17.6 + 3.6 pb 


80.1+ 5.7 +11.0 + 4.0 pb 


85+ 441143 pb 


July 2014 


ATLAS, L = 20.3 fb" 
ATLAS-CONF-14-007 


CMS, L, = 19.7 fb” 


82.6 + 1.2 + 11.8 + 2.3 pb 


83.6 + 2.3 +7.14 2.2 pb 


JHEP06(2014)090 
Effect of beam energy uncertainty: 1.2 pb 
| | | | | | | 
20 40 60 80 120 140 160 
Ot-channel [pb] 
Fig. 9.56 A compilation of t-channel single top cross- 


section measurements from ATLAS 


and CMS, 


taken from 


atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/CombinedSummaryPlots/TOP/. 


Reprinted with permission from CERN. 


ATLAS Ldt=4.59 fo’ Vs=7 TeV 
Measurement result 
[stat © sys. Pstat. 


ABM11 (5 flav.) 

CT10 

CT10 (+ DO W asym.) 
GJRO8 (VF) 
HERAPDF 1.5 
MSTW2008 (68% CL) 
NNPDF 2.3 


15 16 17 18 19 


CMS, Ys =8 TeV, L = 19.7 fb” 
T ji T 


CMS 
1.95 + 0.10 (stat.) + 0.19 (syst.) 
ABM11 


CT10 
CT10w 
HERAPDF 
MSTW2008 


NNPDF 2.3 


f 2 2.2 
Rech, = Spon (Vpn (0) 
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calculations of both the Higgs boson production cross-sections and the decay branch- 
ing ratios [161, 296]. Higgs boson final states involve the measurements of photons, 
leptons, jets (including b-tagged and c-tagged jets), 7 leptons, and missing transverse 
energy, i.e. the building blocks of the LHC SM measurements discussed in this chap- 
ter. The tools developed for SM measurements, both theoretical and experimental, 
can be directly adapted for measurements of the Higgs boson production and decay 
rates, and its properties. For example, for some of the Higgs boson measurements, a 
better signal-to-background ratio can be gained by requiring the Higgs boson system 
to be boosted. Boosted systems have received a great deal of attention at the LHC, as 
discussed in Section 9.2.4 and, for example, Ref. [431]. The knowledge of the SM pro- 
cesses also serves to improve the determination of the backgrounds to measurements 
of Higgs boson final states. Some of the backgrounds can be determined from the data; 
for others some dependence on theoretical predictions is necessary. 

The relative rates for the production of a 125 GeV SM Higgs were shown in 
Fig. 4.53. The dominant production mode is gg fusion for all centre-of-mass ener- 
gies. The other modes added to the discovery potential but are also important for a 
complete understanding of the Higgs boson properties. For example, the VBF process 
probes the couplings to W/Z bosons, while t#H probes the coupling to the top quark. 
The decay branching ratios have been previously shown in Fig. 4.55. For a Higgs boson 
mass of 125 GeV, the dominant decay is into a bb pair, followed closely by a decay into 
WW*. As the Higgs mass is below the threshold for WW production, one of the W 
bosons has to be off mass-shell. The decay into two photons has one of the smallest 
branching ratios, but was still important for the discovery because of the precision in 
which the 4-vectors of the two photons could be measured. With sufficiently precise 
resolution, the two photon mass peak for the Higgs boson can be discerned from the 
copious backgrounds for QCD diphoton production. To some extent, the ATLAS and 
CMS detectors were designed to optimize the search for the Higgs boson. For example, 
both experiments chose solutions for their electromagnetic calorimetry that prioritized 
precise energy resolution so as to be able to reduce the observed width of the Higgs 
boson in its two photon final state. 

The total number of inelastic pp collisions in Run 1 at the LHC was of the order of 
1.5 x 1015. In total, over 500,000 Higgs bosons were produced per experiment in these 
final states (before acceptance and reconstruction). 

The discovery of the Higgs boson (or rather of a new particle with a signature con- 
sistent with the Higgs boson, as noted in the discovery) occurred on July 4, 2012, with 
both ATLAS and CMS observing a significance for a signal at 125 GeV of approximately 
5 sigma. The discovery resulted from approximately 5 fb—! of data at a centre-of-mass 
energy of 7 TeV and approximately 6 fb~! at 8 TeV. Further data-taking increased 
the integrated luminosity at 8 TeV to over 20 fb~', allowing not only 10 standard de- 
viation evidence for the Higgs boson, but also detailed investigations of its couplings. 
For some of the final states, differential distributions were also measured. 

The discovery and first measurements of the Higgs boson in Run 1 required the 
development of sophisticated analysis techniques designed to optimize the signal-to- 
background discrimination in the various analysis channels. This optimization could be 
applied directly to cut-based analyses, or as input to multivariate analysis techniques. 
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Since the signal-to-background discrimination was based on SM theory, in some sense 
this was the discovery of the Standard Model Higgs boson. With the increased statistics 
expected in Run 2, some of this theory dependence in the analyses can be relaxed. 

The rest of this section will be as follows. In Section 9.6.2, the analyses of Higgs 
boson production in several specific channels will be discussed (diphoton, WW*, ZZ*), 
followed in Section 9.6.3 by a summary of the signal strengths for the different modes. 
In Section 9.6.4, the determination of the Higgs mass and width is discussed. Finally, 
in Section 9.6.5, differential Higgs measurements will be discussed. In most cases, the 
details of the analyses are taken from the ATLAS measurements, but similar techniques 
are used by the CMS experiment. 


9.6.2 Decay channels 
9.6.2.1 Diphoton final states 


The Higgs boson discovery potential in the diphoton channel depends critically on 
the precision with which the Higgs 4-vector can be measured. This precision depends 
both on the resolution for the photon energy determination and on the resolution for 
the determination of its direction. If the only interaction per crossing were that of the 
collision that produced the Higgs boson, then there would be no ambiguity for the 
latter, as the charged particle tracking for both ATLAS and CMS allows a precision de- 
termination of the interaction vertex. However, this becomes more problematic under 
the high pileup conditions with which the Higgs boson was discovered, as there can 
be on the order of 20 or more additional interaction vertices. The ATLAS detector was 
designed with a pre-shower detector (and fine lateral and longitudinal segmentation) 
that allows some degree of pointing of the photon back to the correct interaction ver- 
tex. The correct interaction vertex can be identified over 90% of the time for low pileup 
conditions, decreasing to the order of 70% for high pileup conditions (25 interactions 
per crossing) [29]. The Cms detector has fine lateral segmentation but only one depth 
segment (and no pre-shower detector in the central rapidity region), so the interaction 
vertex chosen is chosen using a boosted decision tree using kinematic information on 
the charged tracks from each vertex and on the diphotons in order to provide the best 
match [658]. The efficiency to determine the correct interaction vertex increases as 
the transverse momentum of the diphoton system increases, reaching above 90% for 
py > 50 GeV. Both experiments have a significant amount of material in front of the 
electromagnetic calorimeter (tracking, services, the solenoid coil in the case of ATLAS), 
so a significant fraction of the photons will have converted into electron-positron pairs. 
Algorithms then have to be used to identify those conversions and to correct the pho- 
ton energy accordingly. In general, the same reconstruction algorithms are applied for 
photons from Higgs candidates as for photons in other SM analyses. 

As for other SM measurements, the two photons from the Higgs boson decay have 
an isolation cut imposed; in the case of ATLAS, this involved a combination of the 
requirement of less than 6 GeV of energy in the calorimeters in a cone of R = 0.4 about 
the photon direction and a requirement that the sum of all charged track momenta 
within a cone of 0.2 about the photon direction be less than 2.6 GeV. The isolation 
cut is applied after the event-by-event subtraction of the underlying event and pileup 
energy, similar to what is done in the QCD diphoton measurement. The H —> yy search 
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Fig. 9.58 The ATLAS diphoton mass distribution in the Higgs search re- 
gion. The fractions of the events resulting from real diphotons, from pho- 
ton+jet events and from jet-jet events have been estimated using a double 
two-dimensional sideband method as discussed in the text. Reprinted with 
permission from from Ref. [29]. 


in ATLAS requires (at least) two photons in the (absolute) rapidity range less than 2.37 
(excluding the crack region 1.37 < |n| < 1.56 where the energy resolution is degraded). 
A requirement is made that the ratio of the photon transverse energy to the diphoton 
mass (Er/my) is less than 0.35 (0.25) for the leading (2nd leading) photon. The 
photon transverse energy cuts are larger than for the diphoton measurement discussed 
in Section 9.4.1 and more asymmetric. Such a large (asymmetric) cut emphasizes the 
Higgs boson signal over the continuum diphoton background. 

The mass spectrum for Higgs diphoton candidates with these cuts at 8 TeV is shown 
in Fig. 9.58. Notice that the real QCD diphoton signal dominates over photon-jet and 
jet-jet backgrounds, where one or more jets mimics a real photon (for example with a 
m° which takes most of the momentum of the jet). The isolation cut greatly reduces 
the background rate. The real diphoton and jet background fractions are determined 
by the use of a double two-dimensional sideband method involving (1) loose and tight 
photon identification criteria and (2) loose and tight photon isolation criteria [29]. 

As for the inclusive Higgs boson cross-section, the dominant subprocess involving a 
diphoton final state is gg fusion (87%), followed by VBF, VH (5%), and ttH (1%). As 
the signal to background ratio is small, and varies according to the Higgs subprocess, 
the diphoton events are assigned to 12 exclusive categories, with each category opti- 
mized to maximize the expected signal strength of the subprocess (for example, the 
presence of leptons for VH, two widely separated jets for VBF, etc). The expected 
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Fig. 9.59 The diphoton mass spectrum measured in ATLAS in the 7 and 
8 TeV data. Each event has been weighted by the signal to background 
ratio for the category to which it belongs. The solid red curve shows the 
sum of the signal (for a Higgs of mass 125.4 GeV) and background fits. 
Reprinted with permission from Ref. [29]. 


diphoton mass resolution also differs among the various categories. These exclusive 
categories can be combined into the four channels discussed above (gg fusion, V BF 
(7%), VH and ttH). 

For each category of event, a weight is determined based on the expected signal 
to background for that category, where the S/B ratio is determined from SM theory. 
The (S/B) weighted distribution is shown in Fig. 9.59, where a clear bump is evident 
at about a mass of 125 GeV. The width of the bump is entirely determined by the 
resolution of the photon measurements, as the intrinsic width for a 125 GeV Higgs 
boson is on the order of 4 MeV. The signal strengths for the diphoton channels are 
shown in Fig. 9.64. 


9.6.2.2 ZZ* — Al final states 


Higgs boson candidates decaying into ZZ* final states are formed by selecting two 
same-flavour, opposite-sign lepton pairs, with the dilepton mass combination closest 
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Fig. 9.60 (left) The 4 lepton mass spectrum measured in ATLAS in the 7 
and 8 TeV data. (right) A plot of the sub-leading vs leading dilepton pair 
masses, where the 4 lepton mass was required to be between 120 and 130 
GeV. The dominant probability for the leading pair to be at the Z pole 
mass can be observed. Reprinted with permission from Ref. [42]. 


to the Z mass being termed the leading dilepton pair with the second dilepton pair 
being formed from the remaining two leptons. The leading pair is required to have 
a mass between 50 and 106 GeV. Electromagnetic radiation from the leptons can 
often be measured in the electromagnetic calorimeters and used to correct the lepton 
momentum. Collinear photons are associated with muons and non-collinear photons 
are associated with either electrons or muons. Both track and calorimeter isolation 
requirements are applied to the leptons, after the event-by-event subtraction of the 
underlying event and pileup energy. 

As for the diphoton final state, the 7Z* event candidates are assigned to cate- 
gories (4 in this case; high mass 2 jets (VBF-enriched), low mass 2 jets (VH-enriched), 
additional lepton (VH-enriched), and gg fusion) in order to optimize the Higgs cross- 
section determination. For the VBF category the dijet mass is required to be above 
140 GeV; for the low mass 2 jets VH category, the dijet mass is required to be between 
40 and 130 GeV. 

The resultant 4-lepton mass distribution is shown in Fig. 9.60 (left), where a clear 
(but statistically limited) peak is observed at about 125 GeV [42]. Note also the pres- 
ence of the 4-lepton decay of the Z boson at 90 GeV (useful for calibration) and the 
large increase of the 4-lepton cross-section once both Z’s can be on mass-shell, above 
200 GeV. The mass distribution for the two dilepton pairs is shown in Fig. 9.60 (right), 
where the dominance of the leading pair to be at the Z-pole mass is evident. The signal 
strengths for the ZZ* channels are shown in Fig. 9.64. 
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9.6.2.3 WW* — lvlv final states 


The WW* = lvlv final state is perhaps the most challenging from the analysis per- 
spective and from the viewpoint of perturbative QCD predictions. There is less infor- 
mation available due to the presence of 2 neutrinos in the final state, and there are 
significant backgrounds which depend on the jet multiplicity, necessitating separate 
analyses depending on the number of jets present. The latter requires the use of exclu- 
sive final states (exactly 0 jets, exactly 1 jet) which have intrinsically larger theoretical 
uncertainties than inclusive final states. 

The most sensitive final state is with eu +0 jet. The dominant background for this 
final state is from WW production. WW backgrounds can be suppressed by exploiting 
the properties of W boson decays and of the (expected) spin 0 nature of the Higgs 
boson. The latter results in the two leptons being produced relatively close to each 
other, resulting in a small dilepton mass (< myiggs/2). The dilepton invariant mass 
is used to select signal events and a signal likelihood fit is performed in two ranges of 
my in ep final states with 0 or 1 jet. The final states are also separated according to 
the value of the transverse momentum of the sub-leading lepton (W* leptons will have 
on average lepton momenta smaller than from on-shell W decays). It is also useful to 
calculate the transverse mass: 


2 


2 
mr = (Bt tpt’) —|prt+ py |’, (9.10) 
where El = 4/ (pit)? + (my)2, p%” (pi) is the vector sum of the neutrino (lepton) 


transverse momenta. 

The distribution has a kinematic upper bound at the Higgs boson mass, effectively 
separating Higgs boson production from the WW and top quark backgrounds. The 
transverse mass distributions are shown for ATLAS for two different analysis categories 
in Fig. 9.61 [45]. The dominant background is WW production for low jet multiplicities 
and tt for the higher jet multiplicity. 

The exclusive nature of the 0 and 1 jet bins (basically the large restriction in 
phase space that leads to the creation of large logarithms that need to be resummed) 
adds to the uncertainty for the determination of the Higgs boson cross-sections in this 
decay channel and to the extraction of the Higgs couplings. A great deal of theoretical 
effort has been devoted towards understanding those increased uncertainties, and in 
trying to reduce them through the use of resummation techniques. The uncertainty 
on the jet multiplicity distributions is calculated using the jet-veto-efficiency (JVE) 
method [199] for the gluon-gluon fusion categories and with the Stewart-Tackmann 
method (ST) [865] for the VBF category. 

The signal strengths for the WW* channels are shown in Fig. 9.64. 


9.6.2.4 Associated production 


As seen in Fig. 4.55, the largest branching ratio for a 125 GeV Higgs boson is into a bb 
final state. Production through gg fusion is not measurable due to the overwhelming 
background from QCD bb production. However, measurement of the bb final state is 
possible when the Higgs boson is produced in association with a vector boson. As 
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Fig. 9.61 The transverse mass distributions measured in ATLAS in the 8 
TeV data in the electron-muon channel, for the gluon-gluon fusion enriched 
region. The transverse mass distribution on the left is for the 0 jet bin, while 
the transverse mass distribution on the right corresponds to the > 2 jet 
bin. The Higgs boson signal is given by the uppermost filled portion of the 
histograms. Reprinted with permission from Ref. [45]. 


discussed in Section 8.7, this was the primary search channel for the Higgs boson 
at the TEVATRON. The measurement of the associated production of a vector boson 
(W/Z) with a Higgs boson decaying into a bb pair allows for the direct measurement 
of the coupling of the Higgs boson to b-quarks. There is still a significant background 
from Vbb production, itself not perfectly understood, as discussed in Section 9.3.5. 

In the Higgs boson analysis in this channel, the events are first categorized ac- 
cording to the number of leptons (0,1 and 2), jets (2 or 3 with traverse momenta 
great than 20 GeV and (absolute) rapidity less than 2.5 (inside the b-tagging range) 
and b-tagged jets. Events are rejected if any additional jets with transverse momenta 
great than 30 GeV are found with rapidity greater than 2.5 (in order to reduce tt 
backgrounds). Dedicated boosted decision trees are then constructed for each channel, 
with the boosted decision trees trained to separate the associated production signal 
from the backgrounds. The weighted event distribution is shown in Fig. 9.62. The 
signal strengths for the VH(— bb) channels are shown in Fig. 9.64 [48]. There is a 
significance of 1.4 with an expected significance of 2.6. 


9.6.2.5 77 final states 


The ATLAS analysis channels require either the presence of 2 isolated opposite-sign 
leptons above the transverse momentum threshold, exactly one isolated lepton and 
one hadronic candidate with opposite sign charges, above threshold, or two hadronic 
candidates above threshold. The events are divided into two categories: VBF Higgs 
boson production, with a requirement of two high transverse momentum jets separated 
in rapidity, or boosted, requiring a transverse momentum for the Higgs boson candidate 
of above 100 GeV. The signal-to-background ratio is improved by going to the boosted 
regime. The weighted mass distribution is shown for the Higgs boson search into the 
TT final state in Fig. 9.63 (left), while the signal strength is shown in Fig. 9.63 (right). 
The observed significance is 4.5 with an expectation of 3.4. 
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Fig. 9.62 The distribution of mz, after the subtraction of all backgrounds 
(except for diboson) in ATLAS in the 8 TeV data. The contributions have 
been summed weighted by their values of expected Higgs signal to back- 
ground. Reprinted with permission from Ref. [48]. 


9.6.3 Signal strengths 


The measured signal strengths for the Higgs boson analysis channels for the individual 
ATLAS and CMS experiments in Run I are reviewed in references ATLAS [55] and 
CMS [669]. A joint determination by the two experiments of the signal strengths for 
the various Higgs boson production and decay channels is shown in Fig. 9.64 [54]. 
All signal strengths are consistent with the SM predictions within the relatively large 
Run 1 uncertainties. In most cases, the precision of the joint results improves by the 
expected factor of 1/,/2 from those of the individual experiments. The combined signal 
yield (with respect to the SM prediction) is 1.0740.07(stat)+0.08(syst). The dominant 
systematic uncertainty is due to the theoretical uncertainties for the inclusive cross 
section predictions. 


9.6.4 Higgs boson mass and width 


Precision measurements of the photon and lepton 4-vectors allow for the best de- 
termination of the Higgs boson mass in the diphoton and 4-lepton final states. The 
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Fig. 9.63 (left) The reconstructed weighted m+, distribution, where the 
weights are determined by the signal-to-background predictions for the 
different final states. The data are from ATLAS in the 7 and 8 TeV data 
samples. (right) The fitted values of the Higgs boson signal strength for 
the different 77 final states, and their combination, from the combined 7 
and 8 TeV data. Reprinted with permission from Ref. [38]. 


measurements by ATLAS and CMS for these 2 final states are shown in Fig. 9.65 [36]. It 
would be potentially interesting if the 4-lepton and diphoton final states had different 
masses, but the hierarchy between the 2 states is opposite for ATLAS and CMS, pointing 
to statistical fluctuations as the root cause. The measurements are all consistent with 
each other, allowing a combined determination of the Higgs boson mass from Run 1 of 
MHiggs = 125.09 + 0.21(stat) £0.11(syst). Note that the dominant error is statistical; 
both the statistical and systematic errors should significantly improve in Run 2 of the 
Luc. This particular value for the Higgs boson mass has interesting implications for 
the (meta)stability of the vacuum. 

As mentioned previously, the width of a 125 GeV Higgs boson is too small to 
be measurable, on the order of 4 MeV. However, as discussed in Section 4.8, the 
high mass ZZ* region is sensitive to Higgs boson production through off-shell and 
background interference effects. Amazingly enough, approximately 15% of Higgs boson 
production in the ZZ* channel occurs above the ZZ threshold. The cross-section for 
H — ZZ* is comparable to the cross-section for continuum production of gg > ZZ 
(with which it destructively interferes) above this threshold. The dominant sub-process 
for high mass ZZ final states is through qq > ZZ. The leading order cross-section for 
gg — ZZ is through a box diagram as shown in Fig. 4.57. At the time that the 8 TeV 
analyses were carried out, the NLO (2-loop) calculation for this process was beyond 
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Fig. 9.64 The joint ATLAS +CMS Higgs boson signal strengths, by final 
state and production mode. Reprinted with permission from Ref. [54]. 
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Fig. 9.65 The measured ATLAS and CMs Higgs boson masses separated 
by final state. Reprinted with permission from Ref. [36]. 
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the current technology. Given the progress in calculating two-loop integrals with two 
massless and two massive external lines, this calculation has now been carried out. 
The resulting QCD corrections increase the gg > ZZ cross-section by the order of 
50-100%, depending on the exact scale choice [323]. 

The ATLAS analysis varies the possible K-factors for this process as part of the 
systematic uncertainties (CMS assumes the same K-factors for both resonant and non- 
resonant gg —> ZZ production). The ratio of the off-shell to on-shell signal strengths is 
directly proportional to the Higgs width. The 4-lepton mass distribution in the ATLAS 
Higgs search is shown in Fig. 9.66 in the mass range from 220-1000 GeV, along with 
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Fig. 9.66 The measured ATLAS 4-lepton mass distribution. Reprinted 
with permission from Ref. [37]. 


the contributions from the Standard Model, including that of the Higgs boson [37]. 
The dashed line indicates the impact of an off-shell coupling 10 times the SM value. 
Assuming that the relevant Higgs boson couplings are independent of the energy scale 
of the Higgs production, the combination of the ZZ and WW results yields 95% 
confidence level upper limits for r/r% in the range between 4.5-7.5 (with ATLAS 
and CMS having similar results). 


9.6.5 Higgs differential distributions 


More information on the properties of the Higgs boson can be gained by studies of dif- 
ferential distributions of the Higgs boson itself and of accompanying jets. This is easiest 
done in two decay modes: the diphoton mode, because of its relatively large number of 
signal events, and the 4-lepton mode, because of its large signal to background ratio. 
The diphoton final state is described first. 

Whether jet measurements can be conducted with the diphoton final states can be 
easily determined from the diphoton mass distributions for different jet multiplicities 
observed in Fig. 9.67 and Fig. 9.68 [35]. The diphoton mass bump becomes more 
prominent as the number of jets increases. This implies that events with a Higgs 
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Fig. 9.67 The diphoton mass distributions measured in ATLAS in the 8 
TeV data for the 0 jet channel (left) and the 1 jet channel (right). Reprinted 
with permission from Ref. [35]. 
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Fig. 9.68 The diphoton mass distributions measured in ATLAS in the 
8 TeV data for the 2 jet channel (left) and the > 3 jet channel (right). 
Reprinted with permission from Ref. [35]. 


boson are jettier than events produced from the QCD continuum background. This 
is expected as the bulk of Higgs boson production occurs through gg fusion into a 
colour-singlet (Higgs boson) final state, a situation that leads to a large probability 
for the production of additional jets, as discussed in Section 4.8.4. 

The number of events is much more limited for the case of the 4-lepton final 
state, but it is still useful to combine the two measurements within a common fiducial 
volume, especially as they are consistent with each other. The Higgs boson transverse 
momentum distribution for the diphoton, the 4-lepton and the combined final states 
is shown in Fig. 9.69 [43]. The two final states produce similar results. The Higgs 
boson transverse momentum distribution in the data is observed as somewhat shifted 
towards higher pr compared to the theoretical predictions. 

The jet multiplicity distributions for the combined diphoton and 4-lepton final 
states are shown in Fig. 9.70 for the inclusive (left) and exclusive (right) cases [43]. 
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Fig. 9.69 The ATLAS Higgs boson transverse momentum distribution, 
combining the diphoton and ZZ* channels. Reprinted with permission 
from Ref. [43]. 


The measured cross-sections show somewhat jettier final states than predicted by the 
various theory predictions for the gg fusion process shown.!* Also shown in the figures 
(and added to the gg fusion results) are the contributions from VBF, VH, ttH and 
bbH production. These contributions form a more significant fraction of the total 
production as the jet multiplicity increases. 

The transverse momentum distribution for the lead jet is shown in Fig. 9.71 (left), 
compared to several theoretical predictions [43]. The Ay distribution between the two 
leading jets is shown in Fig. 9.71 (right) [35]. Due to limited 4-lepton statistics for this 
observable, the data for this plot is only from the diphoton final state. The excess of 
data over theory occurs more for small jet rapidity separations. Note that larger jet 
separations are dominated by VBF production. 

A useful summary of the diphoton channel differential cross-sections, compared to a 
number of theoretical predictions, is shown in Fig. 9.72, from Ref. [35]. For reference, 


13The differential distributions reported by the CMS collaboration in [671] are closer to the SM 
predictions. 
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Fig. 9.70 The ATLAS Higgs boson jet multiplicity distribution, combining 
the diphoton and ZZ* channels. Reprinted with permission from Ref. [43]. 


the NNLO Higgs+> 1 jet fiducial cross-section (not shown on the plot) is 10 fb~? 
(including top quark mass effects) [325]. 


9.6.6 Short aside/rant 


The highest precision for inclusive jet observables associated with the production of a 
Higgs boson, or for any other suitable final state, is typically achieved with fixed or- 
der predictions. This is especially true given the number of such final states for which 
NNLO predictions are available. Resummed/parton shower calculations provide a bet- 
ter description for non-inclusive observables where Sudakov effects are important, such 
as the transverse momentum distribution of the Higgs boson. It is notable that, unlike 
comparisons shown for the W/Z + jets finals states earlier in this chapter, there are 
no similar comparisons to fixed order predictions for Higgs boson + jets at the LHC. 
The 2015 Les Houches Standard Model working group report [161] provided a detailed 
comparison of a wide variety of predictions for both inclusive and exclusive final states 
for Higgs boson (+ jets) production at the LHC. In order to control extraneous differ- 
ences, the same PDF was used for all predictions, as well as the same scale (to the 
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Fig. 9.71 (left) The inclusive lead jet transverse momentum distribution 
measured in ATLAS in the 8 TeV data from Ref. [43]. The dijet rapidity 
separation between the two leading jets for Higgs + > 2 jets. Reprinted 
with permission from Ref. [35]. 
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Fig. 9.72 A compilation of the Higgs differential fiducial cross-section 
measurements in the diphoton channel, compared to a number of theoret- 
ical predictions. Reprinted with permission from Ref. [35]. 
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Fig. 9.73 The angles in the H > ZZ* — 4l decay channel useful for 
spin-parity determination of the the Higgs boson. Reprinted with permis- 
sion from Ref. [659]. 


degree for which this was possible). Good agreement was observed between fixed or- 
der predictions and predictions involving parton showering/resummation for inclusive 
observables, such as the lead jet pr distribution for H+ > 1 jet. That is, contrary 
to some conventional wisdom, parton showers and/or resummation do not affect fixed 
order results, for suitably inclusive observables. 


9.6.7 Spin-parity 


As discussed in Chapter 4, the Higgs boson is predicted to be a pure scalar particle 
(JP = 0+). With the addition of beyond Standard Model physics, resulting in different 
allowed interactions in the Lagrangian, the Higgs boson(s) may have a different spin 
or CP state, or even states of mixed CP. The different CP/spin structures may alter 
the kinematic distributions, especially angular distributions, of the Higgs boson decay 
particles. The most useful modes are those involving H > ZZ* — 4l, H — yy, and 
H => WW* — lvlv final states. See for example the angular observables sensitive to 
the spin and parity of the Higgs boson in the H — ZZ* — Al decay channel in Fig. 9.73. 
Both the ATLAS [49] and Cms [659] results strongly disfavour the spin 1 and spin 2 
hypotheses, as well as the 07 state. In studies of potential mixed states, for example 
having the Higgs boson couple via a CP-odd term (thus implying CP violation), 
ATLAS and CMS have found no evidence of non-Standard Model (0*) behaviour. 
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9.6.8 Vector boson scattering 


Vector boson scattering is a key process to fully understand the nature of electroweak 
symmetry breaking. Without the presence of a Higgs boson, the cross section for the 
process would diverge at high diboson masses. Even given the discovery of a Higgs 
boson with a mass of 125 GeV, the process may still not respect unitarity, and a 
detailed study of the process is necessary. 

First evidence for vector boson scattering (VBS) has been found by the ATLAS 
experiment in a measurement of same sign W boson pair production, accompanied by 
two or more jets (W*W+jj), at 8 TeV [25]. The W bosons are required to decay into 
either electrons or muons, with both leptons having the same sign. This suppresses 
the background from normal diboson production. The leptons are required to have 
a transverse momentum greater than 25 GeV, an absolute rapidity less than 2.5, a 
dilepton mass greater than 20 GeV, and a dilepton separation (AR) greater than 0.3. 
The jets are formed using the anti-kr algorithm with radius 0.4 and are required to 
have a transverse momentum greater than 30 GeV, and an absolute rapidity less than 
4.4. Each jet and lepton pair are required to have a separation (AR) greater than 0.3. 
The invariant mass of the two jets with the highest transverse momentum is required 
to be greater than 500 GeV, and the missing transverse energy is required to be greater 
than 40 GeV. 

The VBS process under investigation is of order ahw. The same final state can 
also be produced with a mixed strong-electroweak process of order a%,y,a2. The VBS 
process is similar to that of vector boson fusion production of the Higgs boson, in 
that the initial state W bosons are radiated off of incoming quarks, as shown in 
Figure 9.74(left). Those quarks receive a transverse momentum kick and appear as 
jets. Given their longitudinal boost, the jets in the VBS process tend to have a wider 
separation in rapidity than for the mixed strong-electroweak process. To enhance the 
VBS purity, an additional requirement is made that two jets be separated by a rapidity 
interval of 2.4 or more. 

The measured Ay,; distribution is shown in Figure 9.74(right), along with the 
predicted signal and background components. The |Ay,,;| cut of 2.4 for the VBS region 
is indicated. There is both strong evidence for inclusive production of W*W*jj (4.5 
standard deviations) and of pure electroweak production (3.6 standard deviations). 
The measured fiducial cross sections of 2.1 + 0.5(stat) + 0.3(syst) for the inclusive 
region and 1.3 + 0.4(stat) + 0.2(syst) fb in the VBS region, are consistent with the 
respective Standard Model predictions of 1.52+0.11 fb and 0.95+0.06 fb. More detailed 
investigations will be possible with the higher statistics expected in Run 2. 


9.7 Outlook 
9.7.1 Standard Model physics at the LHC 


Two key aspects of Run II (and beyond) physics at the LHC relate to higher precision 
and extended kinematic reach. No clear signs of beyond Standard Model physics have 
been discovered (to date) at the LHC. Searches for new physics will necessarily require 
precision measurements of SM processes (especially of Higgs boson production) seeking 
deviations that may indicate the presence of new physics. The higher running energy 
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Fig. 9.74 (left) One of the diagrams for VBS. (right) The |A^y;;| distribu- 
tion for events passing the cuts for the inclusive region. The cut in |Ay;;| 
denoting the VBS region is indicated. The W~W~4j prediction has been 
normalized to the Standard Model value. Reprinted with permission from 
Ref. [25]. 


and increased luminosity also result in greater access to the TeV range, in which signs 
of new physics may be more obvious. 

In most cases, increased precision means the calculation of a process to NNLO in 
QCD, and often NLO in EW. There has been great progress in the calculation of LHC 
processes to NNLO and beyond, as detailed both in this book and in workshops such as 
Les Houches [161]. The technology for 2 + 2 processes at NNLO is already relatively 
mature, and the reasonably near future should see NNLO being extended to 2 > 3 
processes. Higgs boson production through gg fusion has already been calculated at 
NLO and the next obvious extension is to carry out the calculation of Drell-Yan 
production to this order. So far, only PDFs at NNLO are available, but for ultimate 
precision a determination of PDFs at NLO may be necessary. Of course, this also 
requires the calculation of the processes in global PDF fits at this order. At this level 
of precision, NLO EW corrections can become equally important as those from NNLO 
(and above) QCD. Above the TeV scale, EW effects are often not subtle, and the 
radiation of W and Z bosons will compete with QCD gluon radiation. In most cases, 
the NNLO QCD and NLO EW calculations may factorize, but in some instances mixed 
corrections may be required, especially if the QCD corrections are large. In general, 
the higher the order of calculation, the smaller the scale dependence will be. However, 
some care will still have to be taken with regards to the choice of a physical scale for 
the process, especially in the presence of a complex final state. 

In the TeV range, photon-initiated processes will become increasingly important, 
for example for high-mass W boson pair production. Most of this book has concen- 
trated on fixed-order calculations, but the TeV range also means that high-z effects will 
become important and threshold resummation corrections will be crucial to calculate. 

From the experimental side,larger data samples by definition mean smaller statis- 
tical errors, and often smaller systematic errors due to an improved knowledge of the 
experimental measurement. A high integrated luminosity necessarily requires a high 
instantaneous luminosity and the presence of many pileup events. This will in many 
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cases degrade the quality of the experimental measurements, and require the increase 
of trigger/analysis thresholds. It will still be crucial, however, to have the ability to 
trigger on standard benchmarks such as W and Z boson production. Even though the 
experimental environment may be daunting, the ATLAS and CMS experiments have 
been designed to operate in those conditions. 


9.7.2 Higgs boson physics at the LHC 


The culmination of Run 1 at the LHC was the discovery of the Higgs boson by ATLAS 
and CMS. A precision determination of its properties, though, requires the higher 
energy and integrated luminosity of Run 2 (and beyond). Data samples of 300 fb~! 
are projected for each experiment in Run 2 at a centre-of-mass energy of 13-14 TeV. 
Through the full running of the LHC, an integrated luminosity of 3000 fb~! can be 
expected. Cross-sections for Higgs boson production subprocesses are a factor of 2-4 
times higher than at 8 TeV, as seen in Fig. 4.53. In this section, we briefly describe 
the experimental improvements in precision expected for the full Run 2 (and beyond) 
data sample, and the needed theory improvements to best match those experimental 
improvements. The discussion roughly follows that in Refs. [161, 439]. 

The largest production cross-section at 13 TeV remains that of the gg fusion sub- 
process. As discussed in Section 9.6, the current experimental uncertainties for this 
subprocess are on the order of 20-40%. Theoretically, the uncertainty was formerly on 
the order of 15%, with the scale and PDF+a,(mz) uncertainties both having roughly 
equal values. The calculation of the gg Higgs boson production to NNNLO has reduced 
the scale uncertainty to 2-3%, while the recent PDF4LHC combination has resulted in 
a PDF uncertainty of the same order. The recommendation for the as(mz) variation 
from the PDF4LHC combination results in a similar level of uncertainty on the gg 
fusion cross-section. 

The experimental uncertainty is expected to decrease to less than 10% (4%) in the 
300 fb~+ (3000 fb~') data sample. This may require an improvement of the theoretical 
accuracy for the production cross-section, with a knowledge of the combined NNLO 
QCD+EW contributions retaining the top quark mass effects [161]. 

With a data sample of 300 fb~', a very rich program of measurements of Higgs 
boson + jets final states is possible. There is a comparable (or larger) increase in 
the Higgs boson + jet cross-section as observed for inclusive Higgs production. As the 
production proceeds primarily through a top quark loop, it is important to probe inside 
that loop to understand the dynamics of production, and in particular to determine if 
any BSM particles may contribute to the loop. Each experiment will have on the order 
of 3000 events (in the diphoton channel) with a jet with transverse momentum above 
the top quark mass. With 3000 fb~', the reach in jet transverse momentum is over 
700 GeV. At jet transverse momenta of this order, there is a very large suppression 
(over a factor of 5) of the Higgs+jet cross-section due to finite top-mass effects (over 
the effective theory). Even without the presence of new physics, there may be new 
dynamics present at these scales. To properly understand the physics of Higgs boson 
+ jet production, it is necessary to calculate the finite top quark mass effects at NLO 
QCD+NLO EW. 
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Higgs boson final states with at least 2 jets are crucial to understand Higgs boson 
couplings, especially the coupling to vector bosons through the vector boson fusion 
process. With a 300 fb~! (3000 fbt) data sample, this coupling can be determined 
to the order of 5% (2-3%). On the theoretical side, this may require the calculation 
of both vector boson fusion and gluon gluon fusion production of the Higgs boson + 
2 or more jets to be known to NNLO QCD and for the finite top mass effects to be 
known to NLO QCD+NLO EW. 

Higgs boson couplings to b-quarks are known primarily through associated pro- 
duction (V H). Currently, this coupling is known to the order of 50%. With 300 fb? 
(3000 fb~'), this can be improved to 10-15% (7-10%). On the theoretical side, one 
bottleneck has been the knowledge of the gg — HZ process, currently known only to 
LO. This process has a sizable contribution to the total rate, and contributes signifi- 
cantly to the total theoretical uncertainty. It is desirable to combine Higgs production 
and decay to the same order, NNLO in QCD and NLO in EW. 

With 300 fb~' (3000 fb~'), the top quark Yukawa coupling should be measured 
to the order of 15% (5-10%) (through the tH and ttH subprocesses). Current ¢(¢H is 
only known to LO in QCD and ttH is known to NLO. For a full understanding of this 
coupling, it is desireable to know both cross-sections (with top quark decays) to NLO 
QCD including NLO EW effects. 

Fiducial cross-sections have been measured for several of the Higgs boson (+jets) 
channels in Run I. With the higher statistics of Run II, this will happen for more final 
states. For most channels in Run I, however, the end result has been measured signal 
strengths and multiplicative coupling modifiers. In Run II there will be a transition 
to simplified template cross-sections (STCS), as discussed in [161]. The primary goals 
of the STCS method are to maximize the sensitivity while minimizing the theory 
dependence of the measurement. This entails: the combination of decay channels, the 
measurement of cross-sections rather than signal strengths, and the determination of 
cross-sections for specific production modes. The physics interpretation (and model- 
dependence) is left for the final stage of the analysis. 


10 
Summary 


10.1 Successes and failures at the LHC 


Perhaps the greatest success at the LHC (besides the discovery of the Higgs boson) 
is the non-discovery of new physics. This statement may seem counter-intuitive. Of 
course, the discovery of new physics would have been desirable, but the experimental 
analysis techniques and the comparisons to theoretical predictions have worked well 
enough that Standard Model physics has not been confused with BSM physics.! An- 
other seemingly counter-intuitive statement is that the LHC benefitted by turning on 
at a lower energy, with a reduced luminosity, in 2010. The lower energy and smaller 
data sample precluded most beyond-the-Standard Model searches. This forced more 
physicists to work on SM physics measurements (leading to the re-discovery of the 
Standard Model), thus forming benchmarks and tools that were useful with higher 
luminosity samples where discovery potential was present. 


The resolution of the LHC detectors, both calorimetry and tracking, is superior to 
that of the CDF and D@ detectors. Tracking, in particular, has higher precision and 
extends to a higher rapidity than possible at the TEVATRON. The improvement in com- 
puting power has meant that detailed event simulations, tracing the electromagnetic 
and hadronic showers, are possible for a variety of physics processes, allowing a better 
understanding of the detector response. 

The theoretical tools and analysis techniques available to LHC physicists are for the 
most part more sophisticated than those available at the TEVATRON. Fixed-order pre- 
dictions at NLO (interfaced to parton shower programs) are available for basically any 
reasonable process, and NNLO calculations for 2 — 2 processes have reached a degree 
of maturity, with calculations of 2 — 3 processes to be expected. The gg —> H process 
has been calculated to NNNLO and similar calculations for Drell-Yan production are 
not far off. The higher order calculations have resulted in smaller theoretical uncer- 
tainties from scale variations. Since it is possible that new physics may not show up 
as a clear peak on a distribution, but rather in subtle variations from SM predictions, 
precision comparisons are crucial for discovery /exclusion of BSM physics. 

In the precision physics region (50-500 GeV), PDF uncertainties are small for most 


1The authors hold out hope that new physics will indeed be discovered at the LHC. 
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parton-parton luminosities, but are relatively unconstrained at high mass, especially 
for initial states involving gluons. Further reduction in PDF uncertainties, especially in 
the high mass region, can come only from data at the LHC. However, in order to provide 
constraining information on PDFs at high x, the data must be consistent: different 
distributions in the same measurement that provide overlapping PDF information 
(for example y; and mz) must be consistent, results from different measurements in 
the same experiment must be consistent (for example inclusive jet production and tt 
production both provide information on the high x gluon), and results from the LHC 
experiments must be consistent. Otherwise, the measurements may change the central 
PDFs, but the tension will result in the uncertainty not changing (or even growing 
larger). 

Theoretical predictions are most powerful when they relate to fiducial cross-sections; 
extrapolating to the full phase space most often introduces an extra layer of uncer- 
tainty, as witnessed for example in the ATLAS measurement of the WW cross-section, 
discussed in Section 9.4.2. Fiducial measurements are more common at the LHC than 
at the TEVATRON, and hopefully the trend towards more fiducial measurements will 
continue. This requires that the theoretical calculations also provide predictions at the 
fiducial level, incorporating for example decays for all unstable particles. 

There are still issues with predictions at the very highest masses; in addition to the 
larger PDF uncertainties in this region, projections of cross-sections/backgrounds are 
often made using parton shower Monte Carlo programs, where parameter variations 
in the Monte Carlo can lead to sizeable uncertainties. In some cases, this increased 
uncertainty is not warranted, especially if the parameter variations can be constrained 
by (higher precision) fixed-order calculations. 

In order to reach the sensitivity needed for new physics searches, the LHC must be 
run at as high a luminosity as possible. This necessarily results in a large number of 
additional interactions in each bunch crossing (pileup), creating problems with particle 
identification and with precision measurements of the particle/jet energies. Techniques 
have been developed for dealing with pileup, in particular the jet area subtraction 
technique discussed in Section 9.2.1. Topology dependences of the pileup energy density 
can limit the ultimate efficacy of the subtraction method. 

By necessity, the jet area subtraction technique removes not only the pileup energy, 
but also the energy associated with the underlying event. Previous measurements at 
the TEVATRON and LHC have included the underlying event in the physical observ- 
ables.” Since the underlying event information has been removed by the subtraction 
technique, the choice of the LHC experiments has been to add it back in by including 
a Monte Carlo prediction for that energy. In some sense, this, although necessary, is a 
step backwards from the trend towards removing as much Monte Carlo extrapolation 
on an observable as possible. 

Tracking at the LHC is better than that at the TEVATRON, and in particular it is 
most often possible to distinguish the interaction vertex for the interaction of interest 
from those of pileup events. Thus, one can distinguish hard scatter jets from pileup 


2A prediction for the underlying event is present in every parton shower Monte Carlo program, 
but not in fixed-order calculations. For these, non-perturbative corrections must be calculated by the 
experimenters to allow comparison of parton-level predictions to hadron level observables. 
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jets using the jet tracking information, and reject jets if too much of the jet energy 
arises from pileup contributions. Alas, this is possible only for jets produced in the 
precision tracking region (|y| < 2.5) and pileup jets are much more of a problem at 
more forward rapidities. Unfortunately, this is a region where jet identification can be 
crucial, as for example in measuring the tagging jets in VBF Higgs production. The 
problem will only get worse as the instantaneous luminosity increases. The solution is 
to provide more information to discriminate between pileup and hard scatter jets (such 
as timing for the forward calorimetry), or to simply raise the jet transverse momentum 
cutoff for forward jets. 


10.2 Lessons for future colliders 
10.2.1 Standard Model cross-sections beyond 14 TeV 


To understand the physics potential of future proton-proton colliders, it is imperative 
to understand the centre-of-mass energy dependence of notable cross-sections at such 
machines. Fig. 10.1 shows the predicted cross-sections for a selection of basic processes, 
ranging over twelve orders of magnitude from the total inelastic proton-proton cross- 
section to Higgs boson pair-production. For inclusive jet and direct photon production, 
50 GeV transverse momentum cuts are applied to the jet and the photon respectively. 

The growth of the cross-sections with ,/s largely reflects the behaviour of the un- 
derlying partonic luminosities, cf. Section 6.5. For instance, the top pair cross-section 
is dominated by the partonic process gg —> tt and the gluon-gluon luminosity rises 
significantly at higher values of ys. The same holds true for the Higgs production 
channel ttH but, in contrast, the associated production channels are dominated by 
quark-antiquark contributions and rise much more slowly. The different behaviour 
means that, unlike at current LHC operating energies, the ttH channel becomes the 
third-largest Higgs production cross-section at 33 TeV and above. As a figure of merit 
for estimating the difficulty of observing the Higgs pair production process it is not 
unreasonable to consider the ratio of its cross-section to the top pair cross-section. In 
many of the possible Higgs boson decays the final states receive significant background 
contributions from the top pair process. The fact that both processes are predomi- 
nantly gluon-gluon induced means that this measure is approximately constant across 
the range of energies considered. From a consideration of total cross-sections alone, 
it is therefore not clear that the prospects for extracting essential information from 
the Higgs-pair process are significantly better at a higher-energy hadron-collider, even 
though the rates increase dramatically. 

A different sort of contribution to event rates can also be estimated from this 
figure. The contribution of double parton scattering events, of the type discussed in 
Section 7.2.3, can be crudely estimated from Eq. (7.43). The value of oo can be 
considered to be approximately energy-independent and around 20mb. Although this 
is not exactly true, the uncertainty on this parameter, and indeed on the accuracy 
of Eq. (7.43) itself, is such that this should be considered sufficient for an order-of- 
magnitude estimate only. A particularly simple application of this is the estimation 
of the fraction of events for a given final state in which there is an additional DPS 
contribution containing a pair of b-quarks. This fraction is clearly given by the ratio, 
01,/(20 mb). From the figure this fraction ranges from a manageably-small 2% effect 
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Fig. 10.1 Cross-sections for select hadron collider processes as a function 
of the operating energy, ys. The cross sections presented in this figure 


have been calculated at next-to-leading order in QCD using the MCFM 
program [311, 314]. 
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Fig. 10.2 Cross-sections for the production of a Higgs boson in associa- 
tion with n or more jets, for n = 0,1,2, normalized to the inclusive Higgs 
cross-section (n = 0). Cross-sections are shown as a function of the min- 
imum jet pr and are displayed for a proton-proton collider operating at 
14 TeV (left) and 100 TeV (right). 


at 8 TeV to a much more significant 15% at 100 TeV. More study would clearly be 
required in order to obtain a true estimate of the impact of such events on the physics 
that could be studied at higher energies, but these simplified arguments can at least 
give some idea of the potentially troublesome issues. 

As an example of the behaviour of less-inclusive cross-sections at higher energies, 
Fig. 10.2 shows predictions for H + n jets + X cross-sections at various values of /s 
and as a function of the minimum jet transverse momentum. The cross-sections are all 
normalized to the inclusive Higgs production cross-section, so that the plots indicate 
the fraction of Higgs events that contain at least the given number of jets. The inclusive 
Higgs cross-section includes NNLO QCD corrections, while the 1- and 2-jet rates are 
computed at NLO in QCD. All are computed in the effective theory with m — oo. 

The extent to which additional jets are expected in Higgs events is strongly depen- 
dent on how the jet cuts must scale with the machine operating energy. For instance, 
consider a jet cut of 40 GeV at 14 TeV, a value in line with current analysis projections. 
For this cut, approximately 20% of all Higgs boson events produced through gluon fu- 
sion should contain at least one jet. The fraction with two or more jets is expected to 
be around 5%. To retain approximately the same jet compositions at 100 TeV requires 
only a modest increase in the jet cut to 80 GeV. 

However, this analysis is not the full story, due to effects induced by a finite top- 
mass that are neglected in the effective theory. This is illustrated in Fig. 10.3, which 
shows the rates for Higgs production in association with up to three jets, taking proper 
account of the top-mass, as a function of the minimum jet pr. As shown in the lower 
panel, a comparison of these results with those obtained in the effective theory reveals 
significant differences. Even for moderate jet cuts of around 50 GeV a finite top-mass 
results in differences in the H + 3 jet rate of approximately 30%. For significantly 
harder jet cuts the effective theory description clearly fails spectacularly. Although 
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Fig. 10.3 Cross-sections for the production of a Higgs boson in association 
with 1, 2 or 3 jets, taking into account finite top-mass effects. Cross-sections 
are shown as a function of the minimum jet pr for a proton-proton collider 
operating at 100 TeV. The lower panel shows the ratio of these results to 
the ones obtained in the effective theory. Reprinted with permission from 
Ref. [267]. 


this should not be a great surprise, given the energy scales being accessed, it is a 
useful reminder of the limitations of approximations that are commonly used at the 
LHC. Such approximations must clearly be left behind in order to obtain meaningful 
predictions for relatively common kinematic configurations at a 100 TeV collider. 

Of course, the differences that exist between the theoretical predictions at 14 TeV 
and 100 TeV offer significant opportunities that are only beginning to be explored. The 
event rates will be sufficiently high that analysis cuts can be devised to take advantage 
of the unique kinematics at a 100 TeV collider, rather than simply “scaling up” the 
types of analyses currently in use at the LHC. For instance, substantially harder cuts 
on the transverse momenta of jets will lead to a predominance of boosted topologies, 
which can be analysed with the types of jet substructure techniques that are still 
relatively new at the LHC, cf. Section 9.2.4. 


10.2.2 Necessary theory developments 


Improvements to the theoretical description of hadronic collisions are of course driven 
by the accuracy of the experimental measurements that can be made. The outstanding 
level of detail that the LHC detectors have been able to provide, from particle iden- 
tification to jet tracking, has enabled experimental uncertainties to be controlled at 
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the few-percent level for quantities such as the transverse momentum of single pho- 
ton or Z-bosons. Such exquisite measurements have thrown down the gauntlet to the 
theoretical community. 

Of course, some of these challenges have been foreseen. Going from the early days 
of the TEVATRON to the build-up to the LHC saw a sea-change in the quality of pertur- 
bative predictions. Rather than being limited to LO predictions for 2 — 2 processes, by 
the advent of the LHC NLO predictions were available for almost all final states of im- 
mediate interest. At the beginning of Run II of the LHC, even NNLO calculations have 
matured to the level of providing differential predictions for events containing jets. The 
pace of these developments has been so fast that it is easy to take for granted a level 
of sophistication that many never believed would have been achieved by now. The 
availability of NLO predictions for Higgs production, multiple examples of NNLO 
calculations matched to a parton shower, and the ability to go from a Lagrangian to 
NLO-accurate showered events, are just a few such examples. Such progress, to a level 
of precision that in some cases borders the ridiculous, may leave the reader wondering 
if challenges remain. Yet, undeniably, much work lies ahead. 

In terms of fixed-order descriptions, the march to higher orders is not yet over. 
It is not clear whether existing techniques for performing NNLO calculations will be 
able to be applied to more complex final states. While continued improvements in 
computer processing power will certainly help, it is almost certain that alternative, 
superior approaches have yet to be devised. Similar arguments apply to the case of 
N°LO predictions, where extensions to the method that could provide more differential 
information, or perhaps be suitable for more general processes, are far from obvious. 
As highlighted in earlier chapters, the presence of substantial electroweak corrections 
at high energies is just beginning to be probed. As the LHC becomes more sensitive 
to even higher energies, the inclusion of higher-order electroweak effects will become 
mandatory in order to retain theoretical predictions of sufficient precision. A simulta- 
neous expansion in both parameters, i.e. correctly including corrections that contain 
a mix of strong and electroweak couplings, will also become important. At present 
no complete calculation of such effects exists, even for a single process. In addition, a 
number of approximations are routinely used to simplify existing calculations. Exam- 
ples include neglecting quark masses, working in the limit m, — oo, and considering 
production and decay stages of resonance production separately. These will all need 
to be revisited, for various physics processes, in the coming years. 

As improved fixed-order predictions become available it will be important that 
their effects are included in parton shower predictions. This will enable the improved 
modelling of the computed processes to be properly taken into account across a wide 
range of experimental analyses. The parton showers themselves will be the subject of 
greater scrutiny as they are held up to the light of experimental data that is ever more 
precise. This may reveal deficiencies in our modelling, either related to an incomplete 
treatment of towers of logarithms, or simply from an unavoidable choice in how the 
shower is constructed. Further subtleties, related to non-perturbative effects such as 
hadronization, fragmentation, and even the quality of the factorization picture itself, 
will eventually require new theoretical understanding as they become the dominant 
sources of theoretical uncertainty. 
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Finally — and, perhaps, most critically — it is important to not lose sight of the fact 
that the ultimate goal of this program is to extract the most possible information from 
the data that the LHC provides. To this end it is imperative to also continually develop 
new tools and novel approaches for doing just that. An excellent example of this is 
the development of jet substructure techniques, that have already had applications 
to top-tagging, jet discrimination, and a host of other analysis methods besides. No 
doubt there are many more insightful theoretical observations of this nature waiting 
to be made in the years ahead. 


Appendix A 
Mathematical background 


A.1 Special functions 


This section outlines the definitions of the special functions that appear in this book, 
together with some simple properties that are germane to the discussion here. 


A.1.1 Gamma function 


The Gamma function is defined by the integral, 


Tr(a) = ni dx rte” (A.1) 


and satisfies the relation, 
r(a+1)=aT(a). (A.2) 
If a is a positive integer then the Gamma function reduces to a factorial, 


Tr(a) = (a — 1)! (A.3) 


In calculations performed in dimensional regularization a useful representation of 
the Gamma function is, 


r(1+£)= exp ( YEE + E =) + O(e?), (A.4) 


where yg ~% 0.57221 is the Euler-Mascheroni constant. Using this representation it is 
easy to see that the following relation holds, 
T?(1—e) ern 


ma-z) = 1 a O(e?). (A.5) 


The beta function B(a, 8) is defined by the integral, 


Tr(a) (8) 


B(a, B) -f dx x°! (1 — x)f7! = Tla+ 8) 


(A.6) 


and, as shown, can immediately be re-expressed in terms of Gamma functions. 
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A.1.2 “+” functions 


The notation of “+” functions appears, for example, in the definition of the splitting 
kernels, cf. Eq. (2.10). The quantity [g(x)], is defined through its integral together 
with a test function f(a) such that 


[ da f(x)g(a) = ie dz (F) = f()) 9(@) + f(1) if da g(x) 
= f: dg f(x) (wo, +ô — f aya(v)) (A.7) 
In other words, 


if da f(a) [g(@)]+ =| dz [f(x) — FA) g(@). (A.8) 
0 0 


A.1.3 Dilogarithm 
The dilogarithm (or Spence’s) function Liz(x) is defined by, 


Liy(x) = i i ae v) (A.9) 


and has the series expansion, 


Useful values for particular arguments are, 


Liz(0) = 0, (A.11) 
Lig(1) = T (A.12) 


There are also a number of identities that combine values of the function for related 
arguments such as, 


2 
Liz(x) + Lig(1 — 2) = 5 — log z log(1 — 2). (A.13) 


A.1.4 Mellin transforms 
The Mellin transform My [f(x)] of a function f(x) is given by! 


lIn mathematical sciences, tpyically the integration limits are 0 and œo, in which case the back- 
transformation reads ; 
co 
1 
f(z) = — ANN! F(N). (A.14) 
271 
—1t00 
However, in what follows the back-transformation is often performed by merely identifiying known 
expressions for Mellin transforms. 
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My [f(x)] = J dea" fo) (A.15) 
0 


There are two reasons why the technology of this transform is interesting. First 
of all, it can be seen very quickly that convolutions of the type encountered in the 
cross-section calculation involving PDFs factorize to trivial products of the Mellin 
transforms. In order to see how this works, consider the convolution of f & ô which 
often occurs in the calculation of cross-sections, 


a = | da(f @G)(a y= fe f 250 )G(a/y) 
3 (A.16) 
= | drdydzð(e — y2) Fae) 
0 
In Mellin space then 
My ((f @6)(e)] = f dex’ (fo)e) = | drdydza™ (xyz) fly) êle) 
0 0 


= J ovoz (yz)" f(y) é(z) = Mn [f(2)]- Mn [6(2)]. 
(A.17) 


Consider, as a useful example, the PDFs transformed to Mellin space, My [fia(a, “r)], 
which, in parallel to the original PDFs, fulfil the DGLAP equation 


Tioga Mn [fiale w] = N, as?) My [fale] (A18) 
with the solution 
o 2 
n [fiya(z, #)] = exp -f “FUN, as(q@))| My [fial Q). (A.19) 
4 


Here, the y(N, as(u?)) are the anomalous dimensions and depend on the strong cou- 
pling. Similar to all other quantities encountered so far they can be expanded as a 
power series in Qs as 


(N, asu a5 (a5 N YON). (A.20) 


i=l 
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Concentrating on the first order term only, y (N), and by direct comparison with 
the DGLAP equation, cf. Eq. (2.31), it is clear that the moment related to the q > qg 
splitting is given by 


1 1 
1+ 2? 
O(N) = My [PPE] = / drz PO (z) = Cp J dea" ( ) 
+ 
0 


a finite number. 

Furthermore, logarithms can be directly identified by analysing the analytical struc- 
ture in the Mellin parameter N, by identifiying poles in 1/(N— No). A straightforward 
way to see this is by realising that 


dl 


_ L 
ant Mw [f(e)] = My [logt (2) f(@)] , (A.22) 
which follows directly from the definition of the Mellin transform when realising that 
aN = exp(N log x). In a similar way, 

My [2*f(z)] = Mnyef(c), (A.23) 


and, if f(x) is regular in the limit x > 1, 


My [22] = 2% f(z)|, - NMw-if(c). (A.24) 


For further reference, in Eq. (A.25) Mellin transforms of different relevant functions 
are listed. 


1 
My [1] = ——— 
wll = gypi 
N 
1 1 
mll]: 
| aR (A.25) 
log(1 — a 1 
M | saz] | - {¥en +60) 4 [e+e ) 
—2£ $ 2 
Here, the w function is related to derivatives of the I function through 
_ dlog I(x) 1, _ dyle) _ d? log (x) 
W(x“) = a and w(x) = g S I> (A.26) 


and ¢ denotes, as usual, Riemann’s Ç function with ¢(2) = 77/6 and ¢(3) ~ 1.2021. 
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A.1.5 Feynman parameterization 


The generic identity for combining propagators into a single denominator through the 
use of Feynman parameters is, 


1 P(v) n 
Ap. Aw IT T D (Fa 1) 


A.1.6 D dimensions and the Gauss integral 


I grimt 


ear 


The use of the Feynman parameterization of loop integrals given in Eq. (A.27) leads 
directly to integrals that take the form, 


ae (02) 
J Or E- AFi (A.28) 


(A.27) 


In order to evaluate this integral it is easiest to perform a Wick rotation from 
Minkowski to Euclidean space. This is accomplished through the transformation, o > 
ilo, with the space-like components of £ unchanged. This is permissible because of 
the location of the poles in the original integrand in Eq. (A.28), which are at lọ = 
+(|¢\? + A — ie) where (? = @ — |¢|?. These are not encountered when rotating the 
real axis by an angle 7/2 (counter-clockwise) to the imaginary one. The Euclidean 
integral can then be parameterized by a generalization of spherical coordinates to D 
dimensions so that, 


/ d” = J dép lR sin”? 0p—1sin?™’? Op_2...sin 02 dOp_1d0p_2...d01, (A.29) 


where le = „4 + |£|?. Since the integrand is only a function of ¿g the angular 
integrations can be performed immediately by using 


s o 
[ dé sin” 0 = VT TG mz) (A.30) 


This result leads to 


de QinP/2 = 
lamer (Qn) PT(D/2) Jaee (A.31) 


The final integral over pg can be cast into the form of a beta-function integral, 


dlg pee Ng —1 n—k ; ee 1 nee S 7 
J Ce E i rea = ( l (A ie)P/? ry dzz k-D/2 al E x)P/ tk 1 
E 


(A.32) 
Using the result for such an integral given in Eq. (A.6), together with Eq. (A.31), one 
arrives at the identity 
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= (A — ie) PPek, 


d% (P) i(—1)" E [(D/2+k) T(n — k — D/2) 
| (27)? (L-A) (4r)P? T(D/2) r(n) 
(A.33) 


A.2  Spinors and spinor products 


A.2.1 A first representation 


There are different related techniques to evaluate spinor products. Here a fairly old ap- 
proach [197, 684] will be briefly presented, which is employed in the HELAS library [604] 
forming the basis of MADGRAPH [150] and in AMEGIC++ [696]. The starting point of 
all constructions lies in the fact that spinors such as u(p, A) and v(p, A), related to 
fermions or anti-fermions with mass m, momentum p and with definite helicity A obey 
Dirac’s equation of motion 


(p—m)u(p,A)=0, (p +m)v(p,à) = 0, (A.34) 


where ø is not necessarily Hermitean and m does not need to be real, while in any case 
p? =m? must be fulfilled. Additionally, the spinors fulfil the spin projection identity 


(1+7°¥)u(p,+)=0, (LFP, +) =0 (aBa 


for all polarization vectors s obeying s -p = 0 and s? = 1. This allows to construct 
massless chiral spinors w, spinors satisfying 


= 1+ 75 
= 2 


w(ko, A)w(ko, A) Ho» (A.36) 


for an arbitrary light-like four-vector kg, which will act as some kind of a “spinor gauge 
vector”. Spinors of opposite chirality w(ko, —A) can be constructed through 


w(ko, A) = A#,w(ko, —A) (A.37) 
with the vectors ko, satisfying 
k = 0, kok, =0 and k? = -—1. (A.38) 


Arbitrary, and potentially massive, spinors can be expressed in terms of these chiral 
spinors as 


p+m 
u(p, A) = -57 w(ko, —A) 
alee l (A.39) 
u(p, À) 9 = w(ko, A) 
P ` Ko 


The relations above also hold true for p? < 0 and imaginary m. 
For the construction of conjugate spinors the proper definitions 


0 


=uly and =v, (A.40) 


ad] 
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applied on the spinors obtained so far do not lead to the correct E.o.M. 
u(p, A)(p—m) =0 and v(p,A)(p +m) = 0, (A.41) 
if the mass is imaginary. Using instead 


u(p,t)\147°¢) =0 and o(p,+)147°4) =0 (A.42) 


and choosing the normalisation conditions 
u(p, A)u(p, A) = 2m and v(p, r)v(p, A) = —2m, (A.43) 
the conjugate spinors are constructed as 


d(p, A) = w(ko, —A) 


pre (A.44) 
V2p- ko 


This yields the following products for massless spinors 


v(p, A) = W(ko, —r) 


(piko)(p2k1) — (pik1)(p2ko) + icuvpo Pi p3 khk 
(pıko)(p2ko) (A.45) 
ūlpı, —)u(pe, +) = [ū(p1, +)u(p2, —)]“ - 


u(pı, +)u(pe, p2) = 


Terms of the form u(+, pi)u(+, p2) are proportional to the masses of the spinors. 
The such-defined spinors fulfil the completeness relation 


5 alp, A)u(p, ala O(p, A)u(p, à) (A.46) 
Xr 


This ensures that the identity 


ptm= ; 5 (: + 5) u(p, A)ū(p, A) + ( — “| u(p, A)0(p, A) 


à 

holds true, which allows to rewrite propagator numerators as spinor products. This 
allows terms of the form t(p,)#u(p2) to be rewritten by decomposing # into spinors, 
resulting in the spinor products of Eq. (A.45). In addition, terms of the form (uqy”u) x 
(üy u) are dealt with by employing Chisholm identities, yielding, again, spinor prod- 
ucts of the type tu from Eq. (A.45). Furthermore, a spinor representation of polar- 
ization vector for external particles may become necessary, unless they are explic- 
itly constructed and contracted into Lorentz-invariant scalar products or similar. To 
achieve this, first for massless vector particles like gluons or photons, it is clear that 
any representation must satisfy the identities 


(A.47) 
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Ex(p, A)p* =0 


x QuPv + QP A.48 
5 €u (p, A) (p, à) = Juv + a pq E, ( ) 


where q denotes an arbitrary, light-like four-vector not parallel to p. This can be 
achieved by 


€u(p, A) ula, A) UP, A). (A.49) 


o 1 

2\/Pq 
Similarly, for massive vector bosons, the polarization vectors must satisfy the com- 
pleteness relation 


* PuPv 
z, €u (p, AJE; (p, à) = —Jpv aN SS ; (A.50) 
A=+,0 p 


It can be shown that this relation can be obtained by writing 


Tee 
€u (p, à) => 8 2 u(q, A) Yu u(q2, à) (A.51) 
np pr=qf +45 


and integrating over the solid angle of qı in the rest-frame of p. The apparent problem 
is that this does allow only for unpolarized cross section calculations and the addi- 
tional integration renders the direct construction of polarization vectors potentially 
advantageous in terms of computing speed. 


A.2.2 Weyl-van der Waerden spinors 


A more convenient way to deal with spinors is encoded in the Weyl-van der Waerden 
formalism [878, 890]. Right- and left-handed chiral spinors in the D(4, 0) and D(0, $) 
representations of the Lorentz group are denoted, as usual, by undotted and dotted 
spinor indices, with complex conjugation connecting them: 


a = (Ya) and Y° = (y$) (A.52) 


Raising and lowering these indices is achieved by applying a tensor € given by 


Cab = E” Ses eb ee (A.53) 


There are two inner products in spinor space, for undotted and for dotted indices, 
namely 


(Cn) = Can? 
Ien] = Gant = (Cn)*. 


In the literature it has become customary to replace the spinors by their momentum 
argument or its label; for example a(k) = |k) and a(k) = |k]. 

With four-vectors residing in the Ds, 4) representation, they are constructed 
using two spinors and the four-vectors of Pauli matrices 


(A.54) 
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gt? = (og) and ot, = (o°, -2) (A.55) 


such that any four-vector k” can be written as 


ktk raat het hg 
kab = of; ky = & J , where 4l Lae (A.56) 


such that for massless vectors k2 = ktk~—. This allows to define spinors ¢(k) such that 


= ibs (A.57) 


kas = Calk)Cs(k), with cult) = ( 


and where ¢, = argk,. There are of course a few choices that can be made: first 
of all, the spinor representation above can be multiplied with a freely chosen total 
phase exp(20). In addition, of course, the choice of which axis defines the +-direction, 
is arbitrary; instead of using the z-axis it would be possible to choose another axis, 
corresponding to a rotation of the Pauli matrices. Irrespective of such details in the 
spinor definition, four-vectors are given by 


k! = o} CnC). (A.58) 


Massive four-vectors can be constructed by decomposing them into two massless ones, 
introducing yet another gauge degree of freedom. As a by-product of this, for massless 
vectors k; and kj 

2kik; = (ij) [ij], (A.59) 


so that a Lorentz product can be cast as a Dirac product. The fact that squared 
scattering amplitudes can always be expressed as Lorentz invariants translates into 
an independence on all choices made in constructing the underlying spinors, thus 
providing a welcome check of any calculation. 

External particles can then be represented in the following way: 


e Fermions are decomposed into left- and right-handed components, u+ = Psu 
with the projection operators 


14 
Prop = P4 = = : (A.60) 


Such chiral fermions with momentum p = (po, P), with signed three-momentum 
P = sgn(po)|p] and with the light-like four-vector p = (p, p) are represented by 


uom = bes (VBP 
i Vip] \ VPo +P x+( 
u_(p, m) = 1 ae +p x- (Ô 
v2 ( 


(A.61) 


N 
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_ 1 / vpo-Bx-(8) 
v+(p,m) = 2l ee 38) (A.62) 
ieee (VBP x00) | 
—(p, 2l VPo +P X+ (b) / 
where the Weyl spinors y+ read 
l 1 (pt\_( Jo 
x+(ô) = Jot & _ (Ee) (A.63) 


TAON 
X— p = Pt pt = —a/ pr . 
Since they are orthogonal and normalised to 29, the resulting Dirac spinors above 
are normalised to +2m. In most cases, massless particles are considered, which 


leads to a massive simplification of the Dirac spinors, since only the upper or 
lower components survive. In such a case, the short-hand notation 


us(k) =v¢(k) =|k*) and a4(k) = 0¢(k) = (k*| (A.64) 


has become customary. 


Polarization vectors for massless particles with momentum p are obtained in a way 
similar to the one already encountered by introducing a light-like gauge vector q 
and read 


Roe pe) 
Lig Gg) oa OEE (A.65) 
i v2 (qF| p*) 
Again, this identification yields the properties of polarization vectors and their 
completeness relation in axial gauge, cf. Eq. (A.48). 


Polarization vectors for massive particles will have a slightly different represen- 
tation; there the light-like gauge vector q is used to construct a vector p” = 
p” — p° /(2pq) q” and the regular transverse polarizations 


FIA pF) 
ei (p, q) = + calli - (A.66) 
= v2 (qF| p+) 
are augmented by a longitudinal one, 
do) = -= eE E i (A.67) 
yp? 2pq 


which then yield the right polarization sum of Eq. (A.50). 


The connection to the spinor formalism that has been introduced can be made trans- 
parent in a very straightforward way by realizing that 


lk*)(k?m| = =p, (A.68) 


Kinematics 647 


which is exactly the definition defining the massless spinors there; thus, up to a po- 
tentially different phase convention, the objects |k} and (k| are identical to the basic 
spinors ŭ and u. Therefore, the spinor products tu of Eq. (A.45) become 


(pip2)(kok1)[pako][k1p1] 
4(pıko)(p2ko) 
(p2ko) (kıpı)[pıp2][kokı] 
4(pıko)(p2ko) 


u(t, pi)u(-, p2) = 


(A.69) 


u(-, piju(t+, p2) z 


A.3 Kinematics 


In most cases, there is a special axis defined through the geometry of particle physics 
experiments, namely the beam axis — the axis parallel to the incoming beams. In 
most experiments (BaBar is a famous exception), this axis is uniquely defined.? Usu- 
ally, this beam axis is chosen to be the z-axis. In most cases, the position, where the 
beams are brought to collision, is pretty well known; this knowledge is used to fix an 
“origin” of the coordinate system. As long as the incoming beams are not polarized 
there is thus only one particular axis, and in such cases events exhibit cylindrical sym- 
metry w.r.t. the z axis. Usually the related azimuthal angle is denoted by ¢. Naively, 
then, other meaningful variables to determine momenta are the polar angle, typically 
denoted by 0, and either the energy or the absolute value of the three-momentum 
of the particle, where the ladder two are connected by the on-shell condition above. 
This set of parameters, say {p,0,¢} is particularly useful for lepton-lepton colliders 
where the longitudinal (w.r.t. the beam axis) momenta of the colliding partons — the 
leptons — are well known. However, for collisions involving hadrons this ceases to be 
true. since usually only some constituents of them, the partons, interact. In such cases, 
the energies and therefore the momenta of the incoming hadrons are known, but the 
energies and momentum fractions of the respective constituents that interact are not 
known a priori. Assuming that the initial partons of the process, the colliding hadron 
constituents, move in parallel to the incoming hadrons, this implies that the overall 
momentum of the colliding constituents along the beam axis is essentially unknown. 
One could then characterise their collision by their centre-of-mass energy and by the 
relative motion of their centre-of-mass system in the lab system. This relative motion 
can be understood as a boost of the constituent system with respect to the lab or 
beam system. Therefore, instead of using the polar angle @ in this cases it is more 
useful to have a quantity with better properties under boosts along the beam axis. 
Such a quantity is the rapidity, usually denoted by y. For a given four-momentum p, 


it is defined as 
E+ pz 


E-— Pz l 
It is simple to show that rapidity differences remain invariant under boosts along the 
z axis. To do so, it is enough to prove that rapidities change additively under boosts. 
Any boost is parameterized by a boost parameter y and by defining an axis. Energy 


1 
Y=5 log (A.70) 


2Even in BaBar, where the beams cross under an angle in the laboratory system, a boost (Lorentz 
transformation) can be applied, to find a system, where the beams collide “head on”. 
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and three-momentum along this axis (here for obvious reasons the z axis) then change 
according to 


E' = E cosh y — p, sinh 
| ey, (A.71) 
p, =pzcoshy — E sinh y 


and thus 
y =y- 7. (A.72) 


Unfortunately the rapidity does not provide a very intuitive interpretation, based on 
geometry. Therefore, another quantity has been introduced, called the pseudo-rapidity, 
commonly denoted by 7. Employing the polar angle 0, it is defined through 


0 
n = log tan = (A.73) 


It is worth stressing here that in the limit of massless particles, their rapidities and 
pseudorapidities coincide. On the other hand, for massive particles, a finite rapidity 
y may be achieved my a mere boost along the beam axis, leading to an angle 0 = 0 
w.r.t. this axis and hence an infinite pseudo-rapidity. 

Having thus characterized the longitudinal component of the momentum, only the 
transverse component needs to be described — which is typically achieved by quoting 
its absolute value p, and the azimuthal angle ¢. For massless particles therefore the 
four-momenta can be written as 


p” = p, (coshn, cos ¢, sin d, sinh n), (A.74) 
while for massive particles 
p” = (m, coshy, p1 cos ¢, p, sind, m] sinhy), (A.75) 
in terms of the transverse mass 
m =p +m’. (A.76) 
This also allows to rewrite the Lorentz-invariant phase-space element of one particle 


as follows: 
dp  _ pidpidyd¢ 
 2E(2r)3 22r)? 


(A.77) 


A.3.1  Light-cone decomposition 


From here, the transition to light-cone variables is fairly straightforward. They are 
constructed by defining two momenta P} and P- — in hadronic collisions they are 
typically given by the incoming beams. Orienting them along the z axis, and assuming 
symmetric collisions, they read 


P4 =(E,0,0, +E), (A.78) 
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where the projectiles’ masses have been neglected. Then the total hadronic centre-of- 
mass energy squared can be expressed as 


S=2P,P_. (A.79) 
Then, any momentum p” can be decomposed as 
p” = aPt + BPH + pE, (A.80) 


where a and ĝ are the plus and minus components of the momentum, respectively. 
The rapidity of p is given by 


1 E+p, 1 1 
log Piri log 2+ = log 


a 
= . A.81 
tO ag. 2 p 28 (Apl) 
In addition, 
p? > abs —pi, (A.82) 
which, together with p? = m? allows a or 8 to be eliminated through 
m +p  m+p, m+ mp 
= = y = 1 = L ey A 
a BS g~e or £B aS ge (A.83) 


In a scattering process pı + p2 > p3 +--+ pn, four-momentum conservation then 
translates into 


n 
Qı + Q2 = Q1 = > Qi 
i=3 


Bi + Bo = b2 = 5 Bi (A.84) 
i=3 


n 
PL1 +82 =0 = Sins 
i=3 


where it has been assumed that the two incident partons pı,2 move along the positive 
and negative z axis respectively, implying that they have zero transverse momentum 
and that ag = $, = 0. This also allows to identify a, and (2 as the light-cone 
momentum fractions the partons carry with respect to the incoming hadrons. It is 
customary to identify these with the respective Bjorken-x, 


zı =a, and x2 = fo. (A.85) 


Appendix B 
The Standard Model 


B.1 Standard Model Lagrangian 
B.1.1 Constructing the Standard Model 


B.1.1.1 Gauge invariance: U(1) 


The Standard Model (SM) of particle physics is arguably the most successful model 
in physics to date, explaining practically all known phenomena on sub-nuclear-length 
scales with only 19 parameters, which are being determined at ever-increasing preci- 
sion. The construction of the model rests on one paradigm, namely gauge invariance. 

The idea is the following: global phase invariance is the invariance of a La- 
grangian under phase transformations of its fields. As an example consider the La- 
grangian for a simple massive Dirac fermion w without interactions, 


£ = b(id—m)v, (B.1) 
which is invariant under transformations parameterized by a single real phase 0, 
y — y =e and y — yY = pe”. (B.2) 


The Dirac fields above play the role of matter and are therefore usually also called the 
matter fields. Their invariance under the transformation actually guarantees that 
they have associated, conserved charges. 

The gauge principle introduces interactions to this Lagrangian by postulating that 
the Lagrangian remains invariant even if the phase depends on space-time z”, 0 = 
0(x), or, in other words, that the Lagrangian is also local phase or gauge-invariant. 
Naively, however, this is not the case, since 


PY = MO GY + O (iJo) WF OO Pd. (B.3) 


and the second, additional term must be compensated for. This is achieved by defining 
a self-compensating gauge-invariant derivative, 


D, = ô, — ieA, (x) (B.4) 


where A,,(x) is a new, additional field that transforms as 


A (z) — A,(x) = Ay(x) + O] (B.5) 


652 The Standard Model 


As a consequence 
(Poy = Db (B.6) 


and similar for the 7. The new field(s) A, are the gauge fields. Through their intro- 
duction enforced by the gauge postulate of local phase invariance the Lagrangian is 
modified and reads 


£L = 4 (iP-m)yY = Y (i+ e4- m) y. (B.7) 


As a consequence, an interaction emerged in the previously free Lagrangian, namely 
between two of the Dirac spinors and one of the gauge fields. Dynamics of the new 
gauge field A,, is generated through adding a kinematic term, which in general has the 
form 


Leaugë = Was D] = (3 Av as Oy An) (OV A” T. o” A!) (B.8) 


where the subscript “g” refers to gauge. Simple inspection shows that on this level, 
without any further fields introduced, mass terms such as 


2 
Lom = = A, AH (B.9) 


violate gauge invariance and are thus forbidden. 


B.1.1.2 Non-Abelian groups: SU(N) 


In the previous example, the gauge transformations of Eq. (B.2) are effected by real 
numbers 6, the phases. These transformations form a group with elements labelled by 
a continuos index, the phase. This structure is known as a Lie group; the algebra of 
its generators consists of exactly one element, 1, and in turn the gauge group from 
the example above is known as U(1). 

Of course more complicated gauge transformations are also possible by arranging 
the matter fields in multiplet structures Y, and labelling the y with some index i: 


Y = (Wn, Pa,- Yn)’. (B.10) 


Global and local phase transformations then mix the various components; this is 
achieved through 
Y — V = exp(id’r") Y (B.11) 


or, in component notation, 
pi — Yi = [exp (16°7")],, Yi = Ui Vi, (B.12) 


exhibiting the fact that the generators 7, in fact are n x n matrices in the space of 
the indices 7, as is their exponential. The phases 0a may or may not depend on z, 
depending on whether the transformations above are local or global. 

As a consequence, the gauge-invariant derivative reads 


Du = Op — igt® A(z) = On — igAy. (B.13) 
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In order to ensure gauge invariance, the gauge fields now transform as 
i 
Aye) A (x) = U(x) A, (x) Ut (a) + me) Ut (x) (B.14) 


and the gauge-invariant derivative transforms as 
D (£) — Di, (£) = U(x) D(x) U+ (x). (B.15) 


In all cases the gauge fields must be massless in order to guarantee gauge invariance. 

The only thing necessary to fix now is the gauge group. In the case of the SM it 
is given by a direct product of three groups, namely SU(3), ® SU(2)r @u (1)y. 

The subscript c of the first group SU (3)e stands for “colour”, and it is the strong 
interactions which are susceptible to colour charges. These interactions are enjoyed 
by the quarks and mediated by the gluons. The most fundamental representation 
of this group is by arranging the individual quarks in triplets with a colour index 
running from 1 to 3, i € [1, 3] such that a quark field q is given by three spinors w,,; 


Wy = (Hei, Va, 03) - (B.16) 


The three “colours” i that the quark fields can carry are often also denoted as “red”, 
“blue”, and “green”, a reminiscence to the first day of colour television. The anti- 
quarks of course carry anti-colour quantum numbers. The interactions mediated by 
the gluons are able to change the colour of a quark. They are related to the eight 
Gell-Mann matrices \° so that in the case of SU(3), T° = A°/2 where the latter 
are given by 


010 0-i 0 1 0 0 

à = 100], = 700], =| 0-10], 
000 000 000 
001 0 0-1 000 

Ag = 000], As={| 000], Ag={[ 001), (B.17) 
100 i0 0 010 
00 0 100 

A7 = 10-i)], AX=+lo010 
Oi 0 a 0 0-2 

and satisfy the commutator relation 
[Aa, Ab] = i fabcAc- (B.18) 


Furthermore, the A are Hermitian and traceless, a property that usually is shared by 
generators of a group. One of the invariants of this group is given by the Casimir 


operator, 
nee 4 
= bp 2 y 2 _ 
Cr = : Ta = 4 2 AS = 3° (B.19) 
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This finishes the quick summary of the properties of the generators of SU(3) in its 
fundamental representation. 

The fae are the structure constants of SU (3); they are completely anti-symmetric 
in the three indices and are given by 


fi23 =1 


fiaz = fies = foas = f257 = fas = f376 ; (B.20) 


fass = fers = lo 


The structure constants form the adjoint representation of the group — in this 
repesentation the generators T4, are matrices of dimension 8 x 8 given by 


ik = tfaik- (B.21) 
The corresponding Casimir operator is given by 


Ca= X TT =3. (B.22) 


a 


In general for SU(N) the generators in the fundamental repesentation are (n? — 1) 
matrices of dimension n x n, and in the adjoint representation there are (n? — 1) 
generators of dimension (n? — 1) x (n? — 1). 

The other group structure relevant for the SM is SU(2)z, acting on the left-handed 
spinor fields. For this group the fundamental representation puts the left-handed 
spinor fields in corresponding doublets, with weak isospin charges of +1/2 for the 
upper and lower field. The generators are given by Ta = 0/2, where the Pauli matri- 


ces Oa read 
0 1 0—i 1 0 


and enjoy the commutation relation 


[Oa, Ob] = l€abe Sc- (B.24) 


Here, the structure constants are the completely anti-symmetric Levi-Civita symbols. 


B.1.1.3 Standard Model before electroweak symmetry breaking 


As indicated earlier, the SM of particle physics is built from fermionic matter fields that 
are subjected to gauge invariance with respect to the gauge group SU (3)e x SU(2)r, x 
U(1)y. The matter fields come in three generations, labelled with a generation index 
I. Fermions are chiral, they are left- or right-handed; the decomposition of Dirac 
fermions into chiral ones is achieved with projectors such that 


1445 


p+ h (B.25) 


1-5 
2 


Y = YL + Yr = Pry t+Pry = 
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Table B.1 The matter fermions of the Standard Model, with the corre- 
sponding charge assignments. The charges fulfil Q = T; + Yw /2. 


Fields SU(3)- | SU(2)r: Ta | UA)y: Yw Q 
(1) 1 2 
QP _ UL i Cr +3 pt +3 
Li 7 
i -; aa 
ud? Cr 0 +4 +3 
a Cr 0 "TE 
I 
(2) vy) +3 0 
Lii = H 0 1 -1 
lzi 2 —1 
ee, 0 0 2 -1 


where P? = Pr, and P? = Ppr as well as Pr, + Pr = 1. Each generation contains a left- 
handed doublet of quarks, yee a colour charge 7 in the fundamental representation, 

D = (ul), dDYT), Here, u and a) denote left-handed up-type and down-type 
quarks, respectively. Each generation also includes a similar left-handed doublet of 
leptons, containing a neutrino yo) and a charged lepton D, LD = (VW, KDT, 
The presence or absence of colour charges indicates that the quarks enjoy the SU (3) 
interaction, while the leptons don’t. In an equivalent way there are also fields that do 
not take part in the left- anes perae a in each generation there are re handed 
colour-charged quark fields ul K and dv and a right-handed lepton field eu ”) There 
are no right-handed neutrinos in the SM. In Table B.1 all matter fields of the SM are 
listed, including their charges: the third component of the weak isospin T}, related to 
SU(2),, and the weak hypercharge Yw, related to the U(1)y, as well as the electrical 
charge Q, which only emerges after breaking the symmetry SU(2), x U(1)y down to 
U(1)g. The charges are related through 


Yı 
Q= + 2 (B.26) 


Defining the gauge-invariant derivatives through their action on the various fermion 
fields as 
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Ag. coe Y; 
I x i a i Qa a : Ww I 
D, es > (a, + igs Gu bap + 192 - Wi bi + gi z Pe dudas) ele 


AG ees 
Due: = (a, F 193 7 Gi 191 5 Bi ôs) ul, 


A2, Yı 
I a AG na , a Yw I 
Didy, = (a, + ig3 T Gi, + ig 5 Bu ds) ae 
(1) a eS E 5 LD 
Dili a = u + 192 2 Wi + 291 2 By Sap LB 


Yi 
De = (a, + in ZB, a), 
(B.27) 


where the À and ø are the Gell-Mann and Pauli matrices, respectively, labelled by 
a € [1, 8] and a € [1, 3]. The colour and weak isospin indices i and j and a and 8 
have been made explicit here. The gauge fields are the eight gluons Gi, the three weak 
isospin bosons W$, and the weak hypercharge field B,,. The gauge-invariant derivatives 
enter the Lagrangian of the SM before electroweak symmetry breaking (EWSB) as 


Lsm = L matter T Leauge 


3 
Lmatter = > [OP PAP + af Du? + dy pap + EP PLP + Op pe 


I=1 
L£ = — lge gan _ lwa waw _ 1p pw 
gauge 47u g w 47e ’ 


(B.28) 


where the summation over the colour or weak isospin labels a is understood and where 
the non-Abelian generalization of Eq. (B.8) yields for the field strength tensors 


Ge, = 3 GS — O,G% + igi fG GS, (B.29) 


introducing self-interactions of the non-Abelian gauge fields G/, and W$ into the gauge 
part of the Lagrangian above. 

Note that in principle also gauge-fixing terms £,.5, would have to be added, which 
could further neccessitate the introduction of Fadeev—Popov ghosts. These unphysi- 
cal degrees of freedom manifest themselves as Grassman scalars, scalars with fermionic 
behaviour, which carry the gauge quantum numbers of the gauge fields. 


B.1.1.4 The need for electroweak symmetry breaking 


Analysing the structure of the Lagrangian in Eq. (B.28) in some detail exposes a phe- 
nomenological shortcoming of the SM: all particles introduced so far must be massless. 

As already discussed, the gauge fields must be massless, since direct mass terms of 
the form m? A? violate gauge invariance. Applying, for example, the gauge transfor- 
mation of Eq. (B.5) on the B field would result in 
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2 1 
B, BY — BiB’ = BBY + PA + za u8)(0"8) + BB". (B.30) 


At the same time the fermions also cannot have a mass term. The reason is that such 
a term for Dirac fermions w has the form 


LDirac,mass = mapp =m (vrdy + LYR) 7 (B.31) 


As long as left- and right-handed fermions transform in the same way, the respective 
phase factors would of course compensate; this is the case in QED where the phase 
transformation acts on all components of the spinors in the same way. Clearly, on 
the other hand, in the very moment left- and right-handed fermions do have different 
gauge transformations — as is the case in the SM, manifest for example in Eq. (B.27) 
— there is no guarantee that this compensation happens. As a consequence, the mass 
term above is explicitly gauge-violating, as it triggers an uncompensated phase factor 
stemming from the SU (2)z transformation acting on the left-handed spinors only. 

At the same time, masses for the weak gauge bosons - the WE and Z° bosons and 
for all fermions are well established. This means that either the underlying construction 
paradigm of the SM, gauge invariance, does not hold true or that the SM in the form 
presented so far is not complete and needs to be supplemented with a mechanism that 
allows the generation of mass in a gauge-invariant way. As it turns out, the latter 
option is realized in nature. 


B.1.1.5 The Brout—Englert—Higgs mechanism 


The Brout-Englert-Higgs (BEH) mechanism solves the problem of gauge-invariant 
mass generation in a spectacularly elegant way, by essentially hiding the local phase 
symmetry through the introduction of a non-symmetric vacuum. Before discussing 
this in some detail, consider first the question of which of the gauge groups of the SM 
is the critical one. Following the reasoning above, it is clear that the gauge bosons 
of the SU(3)., the gluons, are massless, and that there is also a massless U(1) gauge 
boson, the photon. On the other hand, the three weak gauge bosons are massive, and 
fermion masses are disallowed because of the SU (2)z invariance. It is therefore natural 
to concentrate on the SU(2); part of the Standard Model Lagrangian. 

In the BEH mechanism, this proceeds by introducing a complex scalar ® = 
($+, ¢°)", which is coupled to the SU(2)z x U(1) part of the SM through a gauge- 
invariant derivative, 


a ee ees 
Dy®g = (a bs + ig2 7 We + in B, 503) ®,, (B.32) 


where, again, the a and 6 label the weak isospin components of the doublet ®, dt 
and ¢°. The relevant quantum numbers of ® are T3 = +1/2 for the upper and lower 
components and Yw = 1/2. 

Including a potential for the doublet, the original Lagrangian of Eq. (B.28) is 
supplemented with two new parts, namely 
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Lu = (D,®)'(D“®) + pote — (610)? (B.33) 
and peer T 7 
Lur = — ful QP uh — f1 OP Odh — fe" LP Oly, (B.34) 


for its Yukawa interactions with the fermions, which actually contains both left- and 
right-handed fermions. Here, u? and A are real numbers, while the f’7 are arbitrary 
matrices in generation space. In addition, 


® = io’ G, (B.35) 


essentially swapping the position of the ¢* and ¢° components in the Higgs doublet. 
The new, complete SM Lagrangian is given by 


Lsm = Lmatter T Leauge + Lu + LHF, (B.36) 


where all gauge and fermion fields are still massless. 
Analysing the form of the Higgs potential in more detail, and determining its 
ground state through minimization, leads to the condition 


—p? +200 = 0 (B.37) 
or 7 9 
t SIH ee 

(DD iS ae (B.38) 


for the expectation value of the fields in the ground state. Identifying the ground state 
with the physical vacuum gives rise to the notion of the vacuum expectation value 
v of the fields. If 4? and are both real positive numbers this leads to an infinite 
number of equivalent vacua, forming a hypersphere with radius v/v2 in the space 
spanned by the doublet. Note that choosing u? > 0 also means that the ® doublet 
does not have a physical mass term in the Lagrangian, the corresponding term « u? 
just has the wrong sign. 

As the quantization of the fields proceeds by expanding around a single vacuum, 
for instance by using creation and annihilation operators in canonical quantization, 
one of these vacua must be picked.' Without any loss of generality, it has become 
customary to define the vacuum state of the Higgs doublet to be 


= (2), (B.39) 


leading to a reparameterization of the fields as 


P = 6 — (ð) = (oča) l (B.40) 


lOnce such a vacuum has been picked, the system cannot tunnel out of it as there are infinitely 
many equivalent states. 
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Rotating the three W bosons and the corresponding generators by introducing charged 
states 


= 1 228 
Wi? We = a (Wi Fiw) (B.41) 

and, correspondingly, 
do = o! tia = (i i) G a (B.42) 


essentially the ladder operators of SU (2). 
This allows to parameterize the Higgs doublet as 


l 0 
va) = ep |-* D swr] | ven |= oO Ry Ba 
i=+,3 V2 


with x = (0,1)? and where the spatial dependence of the four real fields n(x) and 
&;(x) has been made explicit. These four fields of course have no vacuum expectation 
value, 

(v)o = (&i)o = 0. (B.44) 


A convenient way to see how the breaking of electroweak symmetry (EWSB) pro- 
ceeds is to chose the unitary gauge, fixing the Higgs doublet to have the form 


V+En(L 0 
p = unitary = U(E)® = CEA = (aa) j (B.45) 
V2 


v2 


This essentially means that the Higgs doublet has only one remaining visible field left, 
n, while the three phase fields €; have been rotated away and must be re-introduced in 
the various parts of the Lagrangian — which is achieved by the set of transformations 
in Eq. (B.46), 


f 


E wir) -v@ |E wir] omo + = (avo) ume 


1=+£,3 1=+£,3 
Bl =B, (B.46) 
Y} =U(E)Ut 
Uh =Vp. 


Taking a closer look at the gauge transformation for the W fields in the first line, 
it becomes apparent that the three € fields that have appear to have vanished as 
dynamics degrees of freedom from the Higgs sector have resurfaced as parts of the W 
bosons through the 0,U(é) term. This term in the end results in fields 0,,€, which 
together with the derivatives in the kinetic term of the W gauge bosons from a kinetic 
term for these new fields. It turns out that they indeed are the massless Goldstone 
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bosons, postulted by the celebrated Goldstone theorem [589, 590] as a consequence 
of a broken gauge group. Ultimately, however, they will be “eaten” by the gauge fields, 
absorbed by another gauge transformation. This mechansim provides the gauge bosons 
with a third polarization degree of freedom and in turn turns the hitherto massless 
gauge bosons into massive particles. 

Ignoring this part of the Lagrangian, consider the further terms emerging from 
the transformation above. Starting with the simplest bit, the Higgs potential, the 
invariance under this kind of transformation is clear — the phases just trivially cancel 
out, since 

(16) — (Pİ) = GIUT(E)U(E)G = (S10). (B.47) 


The Higgs potential therefore reads 


2rv? 
> n? — Avn? n* + const. (B.48) 


LH, pot = 


with the Higgs field 7 as the only dynamic member. Also, this field has acquired a 
mass term with the right sign, leading to a physical mass of 


my = vV22. (B.49) 


From the transformations in Eq. (B.46) it is straightforward to see how this works 
with the kinetic term of the Higgs doublet as well. From 


(D,®) = U(E) D,* (B.50) 


it is simple to see that again the phases will just cancel out. The (D,,®)'(D“®) term 
expressed through the transformed fields looks like 


Luin = (DEN (DHE) = Losin + Lm + Lr (B.51) 
and depends only on the vacuum expectation value v, and the dynamics degrees of 


freedom given by the Higgs field 7 and the gauge fields Wi. and B,, where, for conve- 
nience, the primes are omitted. They are given by 


1 
Lyin = 5 (Onn) (O"N) 
= Vv 


2,2 2 T 2 T 
92 r—-yrt: v Bu gi = —9192 By ) 
£ W wre + — 
w 4 a 8 (a) EA g2 ) (i 


1 
=miy WW! + 5m ZZ" 


m? mŽ 
Lip ur (n? + 2un) W7 WH + a (n? + 2un) ZZ", 


with the new fields A, and Z,, — the photon and the Z boson — emerging from the 
diagonalization of the mass matrix in Lm as 
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Ay = sin Ow W; + cos Ow B, (B.53) 
Ly = cos Ow W; — sin Ow B,- 


The photon is massless, m4 = 0, and the masses of the charged W~ and the neutral 


Z? boson are given by 
mw = = and mz = 5 V9 +93, (B.54) 


fixing also the weak mixing angle or Weinberg angle as 


tandw = A or cos Ow = ZW n = (B.55) 


92 mz Vtg 


It is a tedious but straightforward exercise to show that the kinetic term of the gauge 
fields as well as their interaction with the fermions is invariant under the gauge trans- 
formation of Eq. (B.46). 


B.1.1.6 Dealing with the fermions 


This leaves only the Yukawa interactions of the fermions with the Higgs doublet. Em- 
ploying the same transformations as before, Eq. (B.34) expressed through the trans- 
formed fields becomes 

LHF = Utn ee Pricer oe, Æ fiS qD aD $ JET EDD (B.56) 

V2 u Up UR d GLOR ESS ERO 

where for convenience again the primes over the fields have been omitted. This however 
leaves an interesting problem: since the Yukawa matrices fu, a, are arbitrary, there is 
no reason to assume that they are diagonal, leading to non-diagonal mass terms of 
the fields. Insisting on well-defined particle masses — which is sensible! — this means 
that the fields must be rotated in such a way that the mass matrices are diagonal, for 
example 


(1) 
IT __, V2 mă ld 
v 


(B.57) 


giving rise to in total nine fundamental coupling strengths of the Yukawa couplings 
of the Higgs fields to the fermions, namely the mP, mP, and mP, It is one of the 
non-trivial predictions of the SM that the Higgs boson couplings to the fermions are 
directly proportional to their masses. 

This sounds very good, but there is a somewhat unwanted consequence of this 
diagonalization process. Before the diagonalization, the interactions of the fermions 
were diagonal, obviuosly they cannot stay that way. In other words there are two 
bases for the fermions, a basis given by the mass eigenstates of the fermions and 
another one, given by their interaction eigenstates. In order to see how they connect, 
take a closer look at how this diagonalization proceeds. The starting point are the 
arbitrary mass matrices VET summarily denoted as M. Such general matrices can 
be diagonalized by a bi-unitary transformation, 
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Maiag = S'MT, (B.58) 


where S and T are unitary and Mgjag is diagonal with non-zero eigenvalues. In addition, 
every matrix M can be written as a product of a Hermitian and a unitary matrix, H 
and U, 

M = HU, (B.59) 


and in general MM by construction is Hermitian and positive. As a consequence, 
0 0 

S'(M'M)S = (M?)diag — m2 0 J. (B.60) 
0m 


Up to an arbitrary phase in the diagonal elements, © is unique, so that also 


S'FI(MIM)FS = (M? diag, (B.61) 
where ; 
ér 0 0 
F={[ 0 ë 0 |. (B.62) 
0 0 é” 


These phases will come back in the context of CP violation but here this freedom in- 
deed guarantees that m? > 0. The Hermitian part H of the decomposition in Eq. (B.59) 
can be identified with 
H = SMaiagS', (B.63) 
which fixes U as 
U = H-'M and Ut = MH. (B.64) 
The hermiticity of H and the unitarity of U are simple to confirm. Using Eq. (B.63) 
this means that 
Maing = SHS = S'MUtS = SIMT (B.65) 
which also defines the matrix T in Eq. (B.58), T = U'S. With this reasoning, the 
mass terms are diagonalized through 


dr Mbr = (LSS MT)(TiYr) = Y, Maiagt'r- (B.66) 


In other words, the left-handed fields will transform with S, while the right-handed 
fermions will transform with T. Looking at the interactions with the gauge bosons 
effectively leads to three different structures. First of all, there is the interaction of the 
right-handed fermions, which will assume the form 


PRYOR = VEY TIT OE = OE exw ye = PEYR. (B.67) 


This means that the right-handed quarks can be rotated to their mass eigenstate 
without any obvious consequence for their dynamics. The same reasoning also applies 
for neutral interactions of the left-handed fermions, for their interactions with the 
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gluons and the B and W® or, equivalently, the photon and the Z boson. The only 
difference is to replace the unitary matrix T with the unitary matrix S. However, for 
their charged interactions with the W~ this is not true anymore. There, the fermion 
current becomes 


abytd, = UEP Si Sande = Kv ae, (B.68) 
where the Cabibbo-Kobayashi-Maskawa matrix (CKM matrix) 


Ven = S$ Sate (B.69) 


has been introduced. This matrix mixes the quark generations in interactions, where 
a W boson couples to an up-type and a down-type quark, and ultimately allows the 
quarks of the second and third generation to decay weakly into the first generation. 

As a product of two unitary matrices, the CKM matrix itself is unitary, in principle 
with n? = 1 free parameters. Using the arbitrary phases in the matrices S, it becomes 
clear that (2n — 1) phases can be removed by redefining the quark states. In total 
therefore the CKM matrix has n? — (2n — 1) = (n — 1)? = 4 free paramters, 3 free 
angles and 1 phase. This gives rise to the Wolfenstein parameterization, where 
the Cabibbo angle A ~ 0.22 is the small evolution parameter, and, up to third order 
in À 


Vua Vus Vub = F j A ( E in) 
viCKM) _ | Va Vas Va | = =a -X A% . (B-70) 
Via Vis Vi 
aca Ad3(1 — p— in) —AX? 1 


The further parameters of this parameterization are given by 
Aw 0.8, p ~ 0.135, and ņ ~% 0.35. (B.71) 


It is this phase 7, which actually introduces CP violation into the SM. 


B.2 Feynman rules of the Standard Model 


The Feynman rules of the SM are obtained from the Lagrangian introduced above via 
the usual techniques of quantum field theory, that may be found in one of the many 
standard texts. In short, one considers the action of the theory which is related to 
the Lagrangian by 


sai f dec), (B.72) 


In the free (non-interacting) theory this action leads to two-point functions whose in- 
verses represent particle propagators. The interaction terms in the Lagrangian combine 
particle fields of different types that are represented by vertices. 
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B.2.1 Propagators 


Fermion propagators arise in the SM through the two-point functions represented by 
terms in the Lagrangian such as those shown in Eq. (B.7). Recalling the momentum- 
space replacement 0,, —> —ip, leads to the corresponding propagator factor, 


Ga R RAUA (B.73) 
pP -m 
where the inverse is easily obtained. 

The bosonic propagators are slightly more complicated to obtain except for the 
Higgs boson, for which the scalar propagator is trivial. Consider first the W and 
Z propagators that correspond to the gauge terms in the Lagrangian (Lgauge) Of 
Eq. (B.28), together with the mass terms generated by the BEH mechanism, £m 
of Eq. (B.52). These generate two-point functions, for instance between Z fields, of 
the form, 


Z, [(P? — m3)” — pp] Zv. (B.74) 
The inverse of the factor in square brackets yields a propagator of the form 
1 pp” 
si : B.75 
pP? = m3 (o my, ee) 


In the limit mz — 0, corresponding to the case of the photon and gluon, the tensor 
in Eq. (B.74) is not invertible. This necessitates the introduction of additional gauge- 
fixing terms to the Lagrangian. For instance, for the photon one can add the term 


i 
2 


where € is an arbitrary parameter. This leads to the Feynman rules shown. Note that 
the choice € = 1, called the Feynman gauge, is often the simplest choice since it leads 
to fewer terms at intermediate stages. Although this is the end of the story for the 
photon, in a non-Abelian theory the gauge-fixing term introduces unphysical degrees 
of freedom that must be cancelled by ghost contributions. These contributions are not 
discussed further here. 

The Feynman rules for all of the propagators of the SM are shown in Fig. B.1. 


Let. = (O"A,)*, (B.76) 


B.2.2 Interactions of gauge bosons with fermions 


The interactions of the gauge bosons with the fermions of the SM originate from 
Eqs. (B.28) and (B.56). The interactions that do not involve the Higgs boson corre- 
spond to the covariant derivatives appearing in Eq. (B.27), after accounting for the 
effects of the BEH mechansim and the modifications due to the CKM matrix indi- 
cated in Eq. (B.68). The Yukawa interactions of the fermions with the Higgs boson 
are manifest already in Eq. (B.56) and can be simplified slightly by identifying the 
value of the vacuum expectation value through v = 2my/gw. 
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Fig. B.1 Propagator Feynman rules in the Standard Model. 
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Fig. B.2 Feynman rules for fermion-boson interactions in the Standard 
Model. 
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In order to see an example of this in practice, consider the pees for the 
covariant derivative acting on the left-handed lepton doublet, D, LẸ D of Eq. (B.27). 


Lia 
Rewriting the Pauli matrices g“ in terms of the generators 7 and ‘the Wi, Ws fields in 


terms of W=, cf. Eq. (B.41), one obtains 


DL, = (a, + ig a" ri Wt + ga za Wg +r Wg+| + in YB, 503) Se 

(B.77) 
where T= are defined analogously to o~, cf. Eq. (B.42). Expressing the fields wè and 
B,, in terms of the photon and Z-boson fields, cf. Eq. (B.53), and using the weak- 
hypercharge relation in Eq. (B.26) reduces all the fields to physical ones, 


D i= (2, + e Wi +r as Wa + i[g2T; sin 0w + g1 (Q — T3) cos Ow] Ay 


+ i [g2Ts cos Ow — gı (Q — Ts) sin 9w] Z, bap) I; (B.78) 
The final simplification is obtained by relating the couplings gı and g2 through the 


weak mixing angle, cf. Eq. (B.55), and identifying the electromagnetic and weak cou- 
plings, g2 > gw and e = gw sindw, 


pie = (ð, $ T © frip Wit + 139 Wg] + ieQA, 
Seon [Ts - Qsin? Ow] Zp bap) LY. (B.79) 


From this expression it is straightforward to read off the Feynman rules that are shown 
in Fig. B.2. Similar manipulations of the other covariant derivatives can be performed 
to reproduce the remaining Feynman rules shown. 


B.2.3  Self-interactions of gauge bosons 


The self-interactions of the gauge bosons of the SM are generated by the non-Abelian 
contributions to the term Leauge of Eq. (B.28). The corresponding Feynman rules can 
be derived in a straightforward manner by substitution of the corresponding expres- 
sions for the field-strength tensors, cf. Eq. (B.29), accounting for identical-particle 
factors of 1/n! where appropriate. 

In QCD these terms lead to the three- and four-point vertices shown in Fig. B.3. 
Note that the sign of the three-point vertex is sensitive to the direction of flow of the 
momentum; the rule in the figure corresponds to all momenta outgoing (signified by 
the outward-pointing arrows). 

Turning to the electroweak sector, the interactions are again obtained by rewriting 
the field-strength tensor in terms of the basis of physical fields, Wires Zu and A,,. Thus, 
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Fig. B.3 Feynman rules for boson self-interactions in QCD. 
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This leads to the Feynman rules for self-interactions of electroweak bosons that are 
shown in Fig. B.4. The universal Lorentz structure of the interactions can be seen 
immediately from Eq. (B.80), so that the figure summarizes several interactions at 
once using the definitions, 


Cy =e, Cz = gw cosby, (B.81) 


The cubic and quartic interactions that involve the Higgs and other electroweak bosons 
can be read off from the contribution £; shown in Eq. (B.52). The self-interactions 
of the Higgs boson are a result of the potential term introduced in Eq. (B.48). The 
strength of the interactions can be simplified using the implicit expressions for v and 
A in terms of the electroweak coupling and boson masses given in Eqs. (B.49) and 
(B.54). 
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Fig. B.4 Feynman rules for self-interactions of electroweak bosons. In 


this figure vo represent either a photon or a Z boson, with the couplings 
defined by cy = e and cz = gw cos Ow. In the interactions involving Higgs 


bosons, V is either a Z or W boson of the appropriate charge. 


Appendix C 
Catani—Seymour subtraction 


C.1 Catani-Seymour subtraction for NLO calculations 


In the following, kinematic maps for the construction of real subtraction terms, the 
corresponding splitting kernels and their integral will be listed. In all cases, massless 
partons only will be considered. 


C.1.1 Final-state splitter — final-state spectator 
The kinematic maps p;i + pj + Pk — Pij + Pe such that 


pit pj Pk = Pij + Pr (C.1) 
and 

ts Yij,k 

Pij TPR ae: a Pk 
— Vij,k 

i 1 (C.2) 

Pk = Pk, 
1 — Yij,k 


where the recoil parameter y;;,, and the splitting variable Z; are given by 


pe = PiPj 
a PiPj + PjPk + PkPi (0.3) 
a PiPk DiPk i 5 ' 
R= = — and 2 = 1—%. 


(pi + Pj) Pk DijPk 


With their dependence on these two parameters understood, the splitting kernels 
read (CS-5.7-CS-5.9) 


2 
1— 34(1— yij,k) 


(8|Vaigsskl8’) = 8ru” Cras | (1+ 2;) — e(l — 2| Oss! 


Diy. : À; i 
(u| Vug;kle) = Sru” Tras | g” po" Žjp;)" (ea = 3) 
iPj 


1 1 
(ulVo.gjsnl¥) = 16r” Chas | at ( = z 2) 
Iig. 1 — 2;(1 — Yijk) 1— 2;(1 — Yij,k) 


l-e, P X Ma D 
+ (Zipi — Žjpj)" (ipi — Žjp;) . (C.4) 
PiPj 
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Note that here the flavours of the particles resulting from the splitting are indicated 

as subscripts in the kernels V. The flavour of the spectator does not matter, while the 

spin states of the splitter do. They are therefore given as arguments in the () brackets. 
The spin-averaged kernels are denoted by (V) and they are in this case given by 


(Vaig3,k) is z 
— a 1+ 4; 1— 4; 
87g p= Cr 1— 2(1 — yij,k) ( = ) e %) 


Wain) Sie i- aa] 


2 


8T UE as l= 

(Vzig;;k) | 1 1 ” te 

ee 2C'4 = t = 2 + Zi 1 Ži 
8T UE as La ye) 1— (1 — ži )(l — Yij,k) ( ) 


These are the terms that form the basis of splitting kernels in the construction of 
parton showers, see Section 5.3.2. 
Integrating the spin-averaged kernels over the phase-space y = yijk and z = Ži 
yields 
1 


vole) = fapa- faya- ayeye ee (ce) 
0 


Bray p7~ 
0 


where the extra terms in z and y stem from the phase-space integral over momenta 
rewritten in these quantities, cf. (CS-5.16-CS-5.21). 
The resulting integrated dipoles are given by 


1 3 n? 
Vaglé) = Cr È Fag T 5 i 010] 
2 16 
11 50 x? 
vale) =2Ca l= E R- +06] 
In general they can be written as 
1 T? 1 
le = T? pay) Sea EK l 
ve) = (4 - 5) HPG (+0), (C8) 


where the q; are the usual first-order anomalous dimensions of the splitting functions 
Eq. (2.33), and the K; read 
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The last term is the usual soft gluon correction term. 


C.1.2 Final-state splitter — initial-state spectator 


The kinematic maps pj + pj + Pa > Pij + Pa such that 


Dit Pj —Pa = Pij — Pa (C.10) 
and 


Pij = pit pj — (1 — Tija) Pa 


5 (C.11) 
Pa = (1 = iza) Pas 
where the recoil parameter Tija and the splitting variable Z; are given by 
_ PiPa + PjPa — PiPj 
iha (pi + pP 
Se (C.12) 
% =Le = Pia and = 1-3 


(pi + pj)Pa DijPa 


With their dependence on these two parameters again understood, the splitting 
kernels read (CS-5.39-CS-5.41) 


2 
1- 2(1 — Lij,a) 


(Vit, [6) = 874 Cras | (ney =a z| fig 


(u| Vga P) = 8r u?” Tras | gt” (2,0; zp En- 305)" | 


iPj 


1 1 
a = 16ru” Caas |—g'” i = 2 
P S A | i (G — (l — Tija) 1- 3j(l-— Tija) ) 


l—e 
+ Žipi — 255)" (Zi — 25D; F 
E lipi = 5305)" Gip — 303) 

(C.13) 


Apart from the replacement Yijk — %j;,q these kernels are identical to the ones in 
Eq. (C.4), and also the splitting parameter 2; can be related to the one in the case 
of a final-state splitter with a final-state spectator, with the obvious replacement of 
Pk — Pa. It is therefore not a surprise that the spin-averaged splitting kernels (V) for 
the FI case here can be obtained from their FF counterparts in Eq. (C.5) by replacing 
Yij,k with Tija 

The integration of these terms over the emission phase-space with the replacement 
L = Tija and z = %, yields, cf. (CS-5.50) 


1 


1l+e Fi x tg 
Vij(@, €) = O(i;,2)O(1 — Tija) (=) fera je (Vij,k( s Yigk)) 


1 — Tija Brasu? 


(C.14) 
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and here the first big difference with respect to the case of both splitter and spectator 
being in the final-state becomes apparent. As the recoil parameter Tija is not being 
integrated over, for € — 0, the term « 1/(1 — Zij,a) before the integral diverges at 
Tija, a Singularity, which cannot be lifted without taking care in how the two limits, 
€ — 0 and Tija — 1 are approached together. The usual solution for this kind of 
problem is to invoke a “+” function such that 


Vij (Zija, £) = Vi tijas e)l + o(1 — Lij,a) J džV;; (f, £), (C.15) 
0 
where 
Vis (tija ly = Vis (@iz,a, 0), + O (8). (C.16) 


Therefore, cf. (CS-5.57—CS-5.59), 


1 2TR 
Vag(z, €) glk (=). + ô(1-— zx) io + = + O(e) 
2 1 11 1 2 
Vog(x, €) =2C 4 (ps). 5 Can + cs log(2 ») 
11C 
+612) [vuo - F] + 00), 
(C.17) 
with the V;; given in Eq. (C.7). 
C.1.3 Initial-state splitter — final-state spectator 
The kinematic maps pa + pj + Pk Pai + Pe Such that 
Pi + Pj —Pa = Pij — Pa (C.18) 
and 
Dai = Tij, a Pa 
á m (C.19) 


Dk = Pr + pi — (= Tija) Pa- 


It is worth stressing that in this configuration the initial-state splitter actually keeps 
its direction along the beam axis, with a momentum reduced by Tija, which now plays 
the role of a splitting parameter. At the same time, the transverse momentum recoil is 
transferred to the spectator which, consequently, changes its direction. The splitting 
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variable Zik a parameterizing this behaviour is given by 


tia = Ee + PiPa — PiPk 
y (pi + Pk)Pa 


(C.20) 


There is, however, another parameter, u;, which is being used to decompose the 
eikonals into two splitter-spectator dipoles: 


PiPa 
us = ———. C.21 
(pi + Pk)Pa ( ) 


With their dependence on these two parameters again understood, the splitting 
kernels read (CS-5.65-CS-5.68) 


2 
Ht Oba, | E E vine)| Su 


s yei 
(81M) l — Tika + Ui 


(s| Vn 


s') = 8r u Tras i — E — 2tik all — 7] Öss’ 


1— Tika 2u;(1 — Uj) 


Vik,a PiPk 


1 
1 ik,a(l — ik,a 
l — Tika + Ui Pa i, ) 


(HVET |e) = 8m4 Cros |g tino 4 iat (0.22) 


(uve |v) = 16r42*Caay |0" ( 


1 — Tika 2u;(1 — ui) | 
+(1-6)-— hate 
ve) Tika ppp rE 
where 7 i 
Pi P 
die = Pk (C.23) 


Indicating the number of polarizations of particle a with n,(a), the spin-averaged 
kernels in a suitable normalization read 


ns(3) (V°) 2 

eS = 1 t tk,a 1 ik,a 
ns(q) 8Tasu?E P [L= tika +u PEE RT ate) 
ns(q) Vee) =T 1 2Tik,a(l aa Tika) 
ns(g) 8Trasu?E = l-e 
ms(9) (V°) 1 — Tika 

= 1 8 a ee 
ns(q 87a pire we ( Ne Leo Vika 
Ns g V39 1 1— Tik,a 
(9) Vi a = 2C4 || l a4 2 1+ Tik,all — Tik,a)| , 

Ns(g) 8TasH 1 — Tika + Ui Lika 


(C.24) 


cf. (CS-5.77-CS.5.80). 
As in the FI case, the integration over the emission phase-space does not include 
the parameter Tika, governing the kinematic map for the initial state. Therefore, this 


674 Catani-Seymour subtraction 


time the integrated dipoles emerge from an integral over u; only, cf. (CS-5.74) 


yria, £) tu O(x)O(1 — x) (=) fausa — u;)] E ratai) (Vij k (Ži, Yijk)) 
0 


s(a) 87s L7* 


(C.25) 
As before poles emerge which this time are of the type f(x)/e with f(x) is either 
integrable over x or proportional to 1/(1 — x). Therefore the integrated dipoles are 
written as (CS-5.75) 


T 


1 
; 1 J1 ; , 
ya lr, e) = > | [ex VO pe) |e + eô(1-— x) pase, of. (C.26) 
0 
which upon expansion in £ ultimately leads to a structure of the form 
A 1 
vinle, e) ~ wile) + x (1400) +00, (6.27) 


where the ptf are related to the Altarelli—Parisi splitting functions from Eq. (2.33) 
through 


p(x) = Poq(2) 

p(x) = Pag(2) 

p(z) = [Paglt)]y (.28) 
p(x) = [Pa (a), — 2C4 + 6(1 -= x) 70, 


cf. (CS-5.85-CS-5.88). This yields the integrated dipoles for the IF case, 


V9 (x, e) = |-2 Sta 2) p(x) + Cra + O(€) 


V(x, €) = |-z + log(1 — 2) pI (x) + 2Trr(1 — x) + O(e) 


V4 (a, €) = —= pata) + 6(1—2) Yao fe > 7 s)| 
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4 1 2 
l log(2 
(e), ag oe v) 


42 (-1 a EE —) eaGt= o) (C.29) 


The first terms œ 1/e in each dipole are related to the collinear divergence in the 
initial-state, while the terms œ V;j(e) in V%%(a, €) and V99(x, €) capture the soft 
divergences stemming from the eikonals and are related to the emission of soft gluons. 


C.1.4 Initial-state splitter — initial-state spectator 


In the case of both the splitter and the spectator in the initial-state an interesting 
problem emerges. In Catani-Seymour subtraction, there are two paradigm for the 
construction of the kinematics maps between the (n + 1)—particle and the n—particle 
configurations, namely 


(a) to keep the spectator direction and to just stretch it; and 
(b) to keep initial-state particles on the beam axis. 


This works very well for splitter partons in the final-state. In particular, in the case of 
FI dipoles, this was natural — the spectator momentum keeps its direction and merely 
is stretched to compensate for recoil in its longitudinal direction. In so doing of course 
the initial-state particle stayed oriented along the beam axis. For IF dipoles both 
paradigms above already could not be maintained anymore. In fact there this confilct 
was solved in such a way that the spectator accounts for the balance of transverse 
momenta, This becomes even more aggrevated in the case of II dipoles, where both 
splitter and spectator are initial-state partons. In this case, the solution is to let the 
complete final-state, apart from the emitted parton i, of course, capture the transverse 
momentum. This is achieved by defining a boost for all final-state momenta k; 4 pi 
in such a way that four-momentum conservation in the (n + 1)-particle configuration, 


pi + wh — wi — ST kG = 0, (C.30) 
j+i 
is maintained also for the n-particle configuration: 
Be, + ph — X ke = 0. (C.31) 
j#t 
Here, 


Pai = Ti ab Pa 


z, — PaPe — Pi(Pa + Po) (C.32) 
iab Z 4 
i PaPb 


As can be seen, the momentum of the spectator parton pp is not altered in this map, 
but instead all final-state momenta are modified to read 


676 Catani-Seymour subtraction 


5 2h (K+K) o au 2K py 
ane eae (K+K) ae (C.33) 


where the momenta K and K are the total momenta of the dipole before and after 
the mapping onto Born-level kinematics has taken place: 


KY =p% + p — p} 


a (C.34) 
KY = phi + Ph 
The splitting kernels are given by 
2 
OVAS s’) = Sru” Cras = = (1 + Xi,ab) = e(1 a a) Oss! 
— Ti ab 
s|V9etib]s') = 8ru Tras |1— E — 2%; abl — £i ab) Sea" 
H : i 
= 1- ia 2 a v 
(uV Elv) = 8ru Cras [=a 20 $ Ahab Mabb gig (C.35) 
Liab PiPa PiPb 
1 
(pV 999°? |v) = 16ru Chas | gh” (; | Trall =) 
— Ti ab 
1— Gü 2 a v 
Hie Viab  4PaPb a 
Ti,ab PiPa PiPb 
where p 
qh = pt — = ph. (C.36) 
PbPa 


With the notation concerning incident spins already used in the IF case, the phase- 
space integration, cf. (CS-5.153), 
1 T(1—e) 

e T(1 —2e) 


-2e Ms(aé) (V) 
nla) 8TA E 


yaai(g, e) = Q(x)O(1 — x) (1 — 2) (C.37) 


of the spin-averaged splitting kernels yields integrated dipoles of the form 


Va(x, e = V(x, €) + 6? T? 


2 1 oe ee 
(she). Toy 108? J tK 
(C.38) 
up to Ø (e), see also (CS-5.155). Here, the V(x, £) are the integrated dipoles in the 
IF case, cf. Eq. (C.29). Not surprisingly, in the II case thus additional terms emerge. 
The K® read 


K(x) = P&®)(x)log(1— x) + ôT 


(e a). 
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The regular bits of the splitting functions, pies) (x) are the remainders after the poles 
for  — 1 have been subtracted, namely 


PE) (2) = Pala) — 6% 


a 


oT (=) + yo — J , (C.40) 
= + 


where both the splitting functions and their anomalous dimensions y are given by 
Eq. (2.33). In particular they read 


pees) (x) = P(x) foraA#b 
Pree = —Cr (1 +2) (C.41) 
pire) (x) =2C4 —* —l+2(1-2)]. 


Note that the spin-averaged splitting kernels of course effectively are the Altarelli— 
Parisi splitting functions P in D dimensions, 


ng(ai) (V° (x)) 
Ns(a) 8Tasu E 


= P ale) (C.42) 
from Eq. (2.33). 


C.1.5 Final formulae 


After rearranging the various integrated dipole terms and adding the collinear counter- 
terms the part of the cross-section to be added to the virtual part reads 


doto) = do B™) (pa, po) ® Ie) 


1 
+5 J dedol y (apa, P) ® [K (x) + P (apa, i n) 
a’ 0 (C.43) 


1 
+> f drdoly ™ (pa, 1P) & [x (x) + P” (app, 2, n) ; 
b/ 0 


see also Eq. (3.202) and (CS-10.27)-CS(10.30). The integrated dipole terms are 


Os 
I 
(e) mI- e) 
; 4ru? E 4 E Anuz E 
yee) Seren (Fe) +n (SH) +n a (ZE) 
T G Nee 2PiPk 2PiPa 2PiPb 
Vale) ( Arp? | it ) 
+ Ta T; + Ta 
T? pS E \ Qpapr ° \ 2papo 
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Amp? \* Amp? \* 
Dan ( Ae ) + Ts ( a ) : (C.44) 
7 2P Pk 2PbPa 


while the collinear counterterms are given by 


Vole) 
toe 


aa’ As —aa’ aa T; è Ta ~ aa! 
, Ti T, 1 
gee a a i ER 1 = 
i > T? z,“ al) 
aa! . 2) — Os p(t) Ti: Ta me Ty + Tx [Mp 
EES Siig) Qn Para(®) 2 Tj, 2TPaPi t-72 2xpaPo | 
(C.45) 
where 
qq 1+2? l-z 
K” =Cr G= log 7: D (1— x) — ô(1— x) (5 *) 
1 1- 1- t= 
K” =204 ( log z) +( 2 1+ a(1— 2) ) tog = 
1 = x a x x 
50 16 (C.46) 
— 6(1— 2) lcs (F-*) -g rns 


sa T= 
KY =P log — + Cra 


= i 
K” =p) log 


The terms K contain the regular part of the splitting functions pies) defined in 
Eq. (C.40) and listed in Eq. (C.41), 


K%(2) = PE® (x) log(1— x) + 6% T2 


a) 5 (l ol. (C.47) 


C.2 Catani-Seymour subtraction for parton showers 


In the following, the various building blocks for a parton (dipole) shower based on 
Catani-Seymour (“CS shower”) splitting kernels for massless partons will be reviewed. 
In principle there are four cases to consider, namely final- (initial-) state splittings 
with spectator partons in the final- (initial-) state, leading to FF, FI, IF, and II cases, 
respectively. There are typically four ingredients: 


e kinematical variables, parameterizing recoil, and splitting; 


e their relation to the transverse momentum, k], the default ordering parameter of 
the dipole shower, and the corresponding Jacobean J; 
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e the splitting kernels; 
e the phase-space map between four momenta before and after the splitting, denoted 
by p and p. 

These ingredients in the original proposal of [778] and in the implementation of [839] 
will be listed here. It should be noted, though, that in some cases the actual imple- 
mentation of the CS shower in SHERPA version 2 differs from the one described here, 
mainly due to the resummation property described in the main test, in the relevant 
part of Section 5.3.2. 


1. FF (final-state splitter with final-state spectator): 


e kinematical parameters: 


PiPj PiPk 
Yijik = and zi = = 1— zj; (C.48) 
a PiPj + PiPk + Pjpk í PiPk + PjPk 3 


e transverse momentum: 
ki = a(l zik, (C.49) 
where Q? = sijk = (pi + pj + pk)? = (Bij + Pk)”, and 
JEP) = 1 — yay. (C.50) 


Therefore, the relevant phase-space element (including the propagator-like 
denominator) is given by 


1 dk? dz; do k2 
a ae a 1 L x .51 
et 167? k? zi(1— zi) 27 ( z(1— 5a) oe 


splitting kernels: 


(FF) 2 

Kye =Cr | ts ) 
(FF) 1 1 

K —2C + 2+2(1-% 
ane: i E = zil + Yiz) T= 2zi)(1 + Yij;k) ae ) 


Co =a f -24 (1 — =| ; 


(C.52) 
e phase-space map (in the emitter-spectator c.m. frame): 
Pi = ziPij + (1 2i) Yijik Be + ka 
pi = (1— zi) Bij + Zi Yij;ik Pk — kı (C.53) 


Pk = (1 — Yij;k) Dr- 
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2. FI (final-state splitter with initial-state spectator): 


e kinematical parameters: 


Tija = PiPa + PjPa — PiPj EE PiPa zine: (0.54) 
PiPa + PjPa PiPa + PjPa 
e transverse momentum: 

1 — ay. 
k = all- z) i Q, (C.55) 

Tij;a 

where Q? = (pi + pj + pa)? = (Pij + Ba)”, and 
Na 2 ) 

jen _ Tel aia HF) (sti (C.56) 


fash (na: 1) 


where the PDF ratios typical for backwards evolution with na the momentum 
fraction of a are made explict and where the transformation from an integral 
over Tij;a to one over k? cancels the 1/(1— Tij;a) term stemming from the 
one-particle phase-space integral written as a function of 2;;.,; 

e splitting kernels: 


L = 1+ 2 
ane F [T= z+ = Zija) ey 
1 1 
KED = 2C } 2+ 2;(1 A 
ae “|i at (1+ Zija) | 2% + (T= Tija) ae) 
Ke = TR h -— 2z; (1 — z| : 
(C.57) 
e phase-space map (in the emitter-spectator Breit frame): 
Fe 1— 2; l — Zijia ~ T 
Ppi = ZiPij ( i 2 a + ky 
Lijsa 
a Zi 1 Tija) ~ pe 
Pj = (1 a zi) Pij aN, ( 7 ) ki (C.58) 
Tij;a 
ES 
Pk = — Pk 
Tij;a 


3. IF (initial-state splitter with final-state spectator): 


e kinematical parameters: 
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Pobi ZL rag ee eG) 


Tajik = 
PaPj + PaPk PaPj + PaPk 


e transverse momentum: 


ki = uj(1 — uj) —“ Q, (C.60) 


where Q? = (pi + pj + Pa)” = (aj + De)”, and 


1 1— ua Aja (25, n2) 


JOP) = (C.61) 
Zijia 1 — 2ua faja (na: 3) 
where as before suitable PDF factors have been included; 
e splitting kernels: 
2 
KEE =C 1+ Zaj: 
a P [1 — Zajk + Ua ea) 
1 — 243. 
KUE) =Cpr E Vaj;k + zajit! 
Ag Laj;k i 
? 9 1 (C.62) 
KEE) = 2004 | TE aaa 
ggk 1— Lajik + Ua Lajsk sik isk) 
Kiar = TR f — 2taj;k(1 — ras) 
e phase-space map (in the emitter-spectator Breit frame): 
ie 
Pa = Paj 
Laj;k : 
1 — Zajk ~ s 3 
py = (1— ua) is Paj UaPe + kı (C.63) 
Laj;k 
1- Lajik ~ 3 > 
Pk = Ua Paj + (1— ua)ðk — kı. 
Laj;k 


This map stemming from a trivial inversion of the original Catani-Seymour 
phase-space map, however, is not optimally suited for parton shower simula- 
tions, since here the transverse momenta act on the colourful spectator alone 
and not on the total final-state. This is at odds with standard resummation 
ideas as discussed in Section 5.3.2. Therefore, other phase-space maps have 
been proposed in [626, 807], which remedy this situation. Essentially, these 
maps transmit the kų onto the initial-state splitter, which in turn results in 
a Lorentz transformation that puts the splitter back onto the beam axis at 
the expense of the complete final-state being subjected to some transverse 
momentum “kick”. Following the more specific [807], this map is given by 
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first constructing the four-momenta according to 


l> ua - UL Ba a 1 > 
Pa = Pa t > + kı 
Taj;k — Ua T Laj;k — Ua Ua — Laj;k 
1 — Zajk u l-u 1 > 
_ jik_ > a z 
Pj = Baj + > Be + — k (C.64) 
Laj;k — Ua T Laj;k Ua Ua — Laj;k 
r Taj;k — Ua ~ 
Pk = ~~~ Pk 
Taj;k 


In fact, these two phase-space maps are related to each other by the Lorentz 
transformation A“, (K), 


AH (K) = git , Taj;k kf kiv i uall = Tajik) K”K, 
E ” (1> ua)(l-— tajik) BajPr Tajik — Ua PajPk 
i Taj;k kí K, —K¥kiy, 
Laj;k — Ua PajPk 
(C.65) 


with K = Daj + Pk- 
4. II (initial-state splitter with initial-state spectator): 
This is, in some sense, a special case, since the spectator momentum is preserved 
and, consequently, any recoils are captured by the final state. In particular, this 
leads to a phase-space mapping for the emitter of 


Paj = aj;bPa and py = Po, (C.66) 
where l 
Tajib = PaPb — PaPj — PPj l (C.67) 
PaPb 
Then, further building blocks are given by 
e transverse momentum: 
1 — Zaj:b — Vi 
k = ae R, (C.68) 
Laj;b 
where Q? = (pa + pj + P)? = (Baj + By)”, and 
PaPj 
v; = —. C.69 
i PaPb ( ) 
The Jacobean reads: 
Na 2 ) 
JED = 1 ie Vij;a — Vj Faa (a HF ; (C.70) 


e splitting kernels: 
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2 
KED =c | - 1+ #430) 
qg,k 1= tags ( J; ) 
2(1 a 
KID _ o, A ai) aaa 
qqk Taj;b 4 
i‘. 7 (C.71) 
KY) = 20 | Taib L1 (1 Ta 
gg,k 1 — Tajib Taj;b jbl j b) 
II 
Eo = TR h — 2Taj;b(1 mr tai) ; 
phase-space map (in the emitter-spectator Breit frame): 
— 1 A 
Pa = Tajik Paj 
1 — Zajk — vj = > . 
p = E  baj + vibe + ki ee) 
Laj;k 
Po = Do 


and all other (FS) momenta k; are suitably boosted and rotated by a Lorentz 
transformation A 


kj = A(Paj + Pe, Pa + De — pj) kj (C.73) 


given by 


g(t KK + Ky KK (C.74) 
(Kk + Kk)? K2? 


AR (K, K) = gf 
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